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Preface 



In recent years, computational materials science has clearly emerged as an im- 
portant field of condensed matter physics. In particular, the development of new 
computing facilities has made it possible to study physical phenomena at the 
atomic scale by means of ab initio electronic structure methods. Among various 
approaches used, the Linear Muffin-Tin Orbitals method (LMTO) proposed in 
the seventies by O.K. Andersen has played a key role. In its Atomic Sphere Ap- 
proximation (ASA), the LMTO method has been widely used to tackle various 
type of problems. In 1984, O.K. Anderson and coworkers introduced a localized 
LMTO basis set. This new approach, called Tight-Binding LMTO (TB-LMTO), 
has paved the way to an order-A^ scheme, giving new impetus to the study of 
numerous physical properties of systems with large number of atoms. 

This book is based on selected contributions presented at a workshop, orga- 
nized in October 1998 in the monastery of Mont Saint Odile (near Strasbourg, 
France) . A large number of scientists involved in the development and the prac- 
tice of the LMTO method gathered there for three days. The first part of this 
book is devoted to the formalisms for ground and excited states. It starts with 
a review, by Andersen and coworkers, of the TB-LMTO method and its genera- 
lization. The Schrodinger equation of A^th order in the energy expansion for an 
overlapping muffin-tin potential is solved using a minimal basis set. The aim of 
this third generation LMTO method is to take a further step beyond the limi- 
tations of the popular atomic-sphere approximation. The present approach uses 
wave functions which are accurate not only in the muffin-tin spheres but also in 
the interstitial region. In the conventional implementations of the LMTO-ASA 
method it is difficult to determine the forces on the atoms. For this reason Full- 
Potential LMTO approaches have been developed. In this book two different 
approaches are described. The first one is proposed by Methfessel and coworkers 
and is based on smooth Hankel functions. The use of these special functions 
allows the method to provides a good accuracy of the total energy with an al- 
most minimal basis set. The second approach proposed by Wills and coworkers 
uses a large basis set whieh ean describe multiple principal quantum numbers 
within a single fully hybridized basis set. This large basis set allows this method 
to determine the exeited states to higher energies without the need for cumber- 
some multiple energy panel calculations like in the minimal basis set methods. 
However, the drawback and limitation of these FP methods remains the huge 
computational effort needed, which inhibits calculations of systems with large 
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numbers of atoms. To overcome these computational hurdles, Kollar and cowor- 
kers presented the so called full-density method, a method halfway between the 
ASA and the FP, which combines the simplicity of the former and the accuracy 
of the latter. In the initial description of the Density Functional Theory (DFT), 
the eigenvalues are Lagrange multipliers, and thus no physical meaning can be 
associated with them. However the success of excited states calculations based 
on DFT can be understood due to the fact that the Kohn-Sham equations can 
be viewed as an approximation to the quasi-particle equations where the self- 
energy is local and time independent. Alouani and Wills give the basics and 
some applications to the determination of optical properties and x-ray magnetic 
dichroism. 

One of the main success of the LMTO schemes in the last few years has 
been the description of magnetic systems. A fully relativistic formalism and the 
applications to spectroscopy are presented by Ebert. The key question of the 
magnetic anisotropy of bulk and thin films is addressed by Eriksson and Wills, 
illustrating the high level of precision reached, whereas Temmerman and co- 
workers present a unified formalism to describe localized and delocalized states, 
pointing out the importance of the self-interaction correction. Another spectacu- 
lar use of the TB-LMTO method is given by Kudrnovsky and eoworkers on the 
Interlayer Exchange Coupling (lEC). Ab initio formulations of the lEC between 
non-collinearly aligned magnetic slabs lead to results in good agreement with 
experiment. 

One advantage of the TB-LMTO is its use to describe disordered systems such 
as alloys with the precision of ab initio methods. A short review of the TB-LMTO 
within the Coherent Potential Approximation (CPA) applied to disordered alloys 
and surfaces is given by Turek and coworkers, whereas Abrikosov and coworkers 
present a locally self-consistent Green’s function method. This latter order-A^ 
method is particularly interesting for systems with a large number of inequivalent 
atoms. 

Mathematical and numerical problems are the building blocks of ab initio 
methods, and numerical algorithms for solving various parts of the formalism are 
of great interest. In particular, the diagonalisation of sparse matrices, which is at 
the heart of the TB-LMTO method, by efficient algorithms is highly desirable. In 
this respect, Scott has given an introduction to direct methods for the solution 
of large-scale linear systems, emphasizing the progress made in the development 
of routines which are now available in numerical libraries. The book ends with 
two contributions on the determination of the electronic structure in real space: 
a real-space derivation of the TB-LMTO method by Spisak and Hafner, and the 
venerable semi-empirical tight-binding method by Cornea and StoefHer. 

I would like to thank each of the sixty-seven scientists from seventeen different 
countries who make this book possible, especially those who came from far away 
countries, like Argentina, Brasil, India, and Japan. The meeting was held over 
three days in a lively atmosphere where the most recent progress in the ab initio 
methods was presented and where discussions continued well after the talks. 




Preface VII 



Besides the fourteen invited talks which form the basis of this book, nineteen 
shorter presentations offered the opportunity to focus on more precise points. 

The “Ab-initio Calculations of Magnetic Properties of Surfaces, Interfaces, 
and Multilayers” TMR European network provided the first impulse for this 
meeting. The support of the European Science Foundation (ESF) through the 
program 'P k allowed the participation of a large number of scientists. Finally, the 
local support of the IPCMS (Institut de Physique ct Chimic de Strasbourg) was 
greatly appreciated. I would also like to thank M. Alouani for constant interest 
and I. Galanakis, who struggled and succeeded in finalizing the electronic version 
of the book. 

Strasbourg, August 1999 Hugues Dreysse 
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Abstract. The TB-LMTO-ASA method is reviewed and generalized to an accurate 
and robust TB-NMTO minimal-basis method, which solves Schrodinger’s equation to 
Ath order in the energy expansion for an overlapping MT-potential, and which may 
include any degree of downfolding. For A = 1, the simple TB-LMTO-ASA formalism is 
preserved. For a discrete energy mesh, the NMTO basis set may be given as: (i") = 

^ (£n, r) in terms of kinked partial waves, <^(e, r), evaluated on the mesh, 
eo,...,eN- This basis solves Schrodinger’s equation for the MT-potential to within an 
error ex (e — so) ... (e — £iv) . The Lagrange matrix-coefficients, as well as the 

Hamiltonian and overlap matrices for the NMTO set, have simple expressions in terms 
of energy derivatives on the mesh of the Green matrix, defined as the inverse of the 
screened KKR matrix. The variationally determined single-electron energies have errors 
cx (e — £o)^ ••• (e — £ n )^ ■ A method for obtaining orthonormal NMTO sets is given and 
several applications are presented. 



1 Overview 

Muffin-tin orbitals (MTOs) have been used for a long time in ab initio calcu- 
lations of the electronic structure of condensed matter. Over the years, several 
MTO-based methods have been devised and further developed. The ultimate aim 
is to hnd a generally applicable electronic-structure method which is accurate 
and robust, as well as intelligible. 

In order to be intelligible, such a method must employ a small, single-electron 
basis of atom-centered, short-ranged orbitals. Moreover, the single-electron Ha- 
miltonian must have a simple, analytical form, which relates to a two-center, 
orthogonal, tight-binding (TB) Hamiltonian. 

In this sense, the conventional linear muffin-tin-orbitals method in the atomic- 
spheres approximation (LMTO-ASA) [1,2] is intelligible, because the orbital may 
be expressed as: 

XRl{^r) = 4>Rl{^r) + "^iiR'L' {^R'){Hr'L',RL - £v5r'r5ul) ■ ( 1 ) 

R'L' 



Here, (f>RL {xr) is the solution, pRi [e,,,rR)Yi^ [vr) , at a chosen energy, £,,, of 
Schrodinger’s differential equation inside the atomic sphere at site R for the 
single-particle potential, '^r vr (rR) , assumed to be spherically symmetric in- 
side that sphere. Moreover, = r — R and L = Im. The function tpRi (e,r) 

H. Dreysse (Ed.): Workshop 1998, LNP 535, pp. 3-84, 1999. 

© Springer-Verlag Berlin Heidelberg 1999 
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thus satisfies the one-dimensional, radial Schrodinger equation 



^r^Ri (s,r) = - 



£ -Vr (r) - 



1(1 + 1 ) 



ripRi (£,r) ■ 



( 2 ) 



In (1), 4>rl(v) are the energy-derivative functions, dipRi(e,r) /de\^ Fim (?) ■ 
The radial functions, (p and p, and also the potential, v, are truncated outside 
their own atomic sphere of radius s, and the matrix, H, is constructed in such a 
way that the LMTO is continuous and differentiable in all space. Equation (1) 
therefore expresses the LMTO at site R and (pseudo) angular momentum L as 
the solution of Schrodinger’s equation at that site, with that angular momentum, 
and at the chosen energy, plus a ’smoothing cloud’ of energy-derivative functions, 
centered mainly at the neighboring sites, and having around these, all possible 
angular momenta. 

That a set of energy-mdependent orbitals must have the form (1) in order to 
constitute a basis for the solutions 'Pi (r) - with energies in the neighborhood 
of - of Schrodinger’s equation for the entire system, is intuitively obvious, be- 
cause the corresponding linear combinations, J2rl K-RL (*'^f) £RL,i, will be those 
which locally, inside each atomic sphere and for each angular momentum, have 
the right amount of ^ - provided mainly by the tails of the neighboring orbitals 
- added onto the central orbital’s p. Since by construction each pm (e, r) is the 
correct solution, this right amount is of course Si —Sn. In math: since definitions 
can be made such that the expansion matrix Hr'l' rr is Hermitian, its eigen- 
vectors are the coefficients of the proper linear combinations, and its eigenvalues 
are the energies: 

"^Xrl ('Tr) CRL,i = ^ \^RL (rn) + (si - s^) ^rl (rfl)j crlp 
RL RL 

» 'Y^(j)RL(£i,'CR)cRL,i = Piir). ( 3 ) 

RL 

Hence, H is a. Ist-order Hamiltonian, delivering energies and wave functions with 
errors proportional to (si — , to leading order. 

First-order energies seldom suffice, and in the conventional LMTO- ASA me- 
thod use is made of the variational principle for the Hamiltonian, 

n ^ + Y.^vr{tr) , ( 4 ) 

so that errors of order (si — s„)'^ in the basis set merely give rise to errors of order 
(si — £^)^ in the energies. With that approach, the energies and eigenvectors are 
obtained as solutions of the generalized eigenvalue problem: 

X] liXR'L' \H - Si^IXRl) - (£i - £u) iXR'L' I XRl)] CRL,i = 0 , ( 5 ) 

RL 

for all R' L' . If we now insert (1) in (5), we see that the Hamiltonian and over- 
lap matrices are expressed in terms of the Ist-order Hamiltonian, H, plus two 
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diagonal matrices with the respective elements 

(^(pRL I ‘Prl'J = Jo^PRi {r) ‘fm (r) r^dr, (^(pRL \ ^rl^ = ‘Pri {rf r'^dr. (6) 

These matrices are diagonal by virtue of the ASA, which approximates integrals 
over space by the sum of integrals over atomic spheres. If each partial wave is 
normalized to unity in its sphere: Jq Pri (r)^ r^dr = 1, then ((/> | tp) is the unit 
matrix in the ASA, and the Hamiltonian and overlap matrices entering (5) take 
the simple forms: 

(x \n - e.l x) = {H- £,) [l + {4>\ p) {H - £,)] (7) 

(x I x) = ^ + {H -e^)(p\(p^ 1 + (0 j {H - e^) 

+ {H-e,) (0 I I 0)' {H-e,). 

Here and in the following we use a vector-matrix notation according to which, 
for example xrl (rfi) and xrl (rij)* are considered components of respectively 
a row-vector, x (r) , and a column- vector, x (r)^ • The eigenvector, Cj, is a column 
vector with components crr^i. Moreover, 1 is the unit matrix, e^, is a diagonal 
matrix, and id is a Hermitian matrix. Vectors and diagonal matrices are denoted 
by lower-case Latin and Greek characters, and matrices by upper-case Latin 
characters. Exceptions to this rule are: Y (f) , the vector of spherical harmonics, 
the site and angular-momentum indices (subscripts) ii, L, /, and A, and the 
orders (superscripts) L, M, and N. Operators are given in calligraphic, like 7i, 
and an omitted energy argument means that e = 

With the 0(r)’s being orthonormal in the ASA, the LMTO overlap matrix 
in (7) is seen to factorize to 1st order, and it is therefore simple to transform to 
a set of nearly orthonormal LMTOs: 

X(r) = X (r) [1 + (^ I 0) - £1^)] ( 8 ) 

(xI'W-e./lx) = = [1 + (id - e^) I {H - e^) 

= H-e, ~ {H -e,)(p\<i>){H -e,) + ... 

(x I X> = 1 + (0 I 0) ( 

0(r) = 0(r) - 0(r) (p \ . 



id -e 
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Here, the energy-derivative function, (j) (r) , equals (j) (r) , orthogonalized to (j) (r) . 
Finally, we may transform to a set of orthonormal LMTOs: 



X(r) = x(r) 



1 + { H — 



H -e, 



1-- H 
2 






(9) 






{x\n-e,\tl ^ H-e, = H-s,- 

\{h - j 



[ji — 



We thus realize that of the Hamiltonians considered, H is of 1st, H is of 2nd, and 
H is of 3rd order. As the order increases, and the energy window - inside which 
the eigenvalues of the Hamiltonian are useful as single-electron energies - widens, 
the real-space range of the Hamiltonian increases. For real-space calculations [3- 
7], it is therefore important to be able to express a higher-order Hamiltonian as 
a power series in a lower-order Hamiltonian like in (8) and (9), because such a 
series may be truncated when the energy window is sufficiently wide. 

The energy-derivative of the radial function ip (e, r) depends on the energy de- 
rivative of its normalization. If we choose to normalize according to: ip (e, r) r^dr 

1, then it follows that ip (r) ip (r) r^dr = 0. Choosing another energy-dependent 
normalization: ip (e, r) = ip (e, r) [1 + (e — £j^) o] , specified by a constant o, then 
we see that: ip(r) = (p (r) -f- ip (r) o. Changing the energy derivative of the norma- 
lization thus adds some ip (r) to ip (r) and thereby changes the shape of the ’tail 
function’ (p (r) . Since all LMTOs (1) should remain smooth upon this change, 
also H must change, and so must all LMTOs in the set. The diagonal matrix 

I , whose elements are the radial overlap integrals: o = ip (r) ip (r) r^dr, 
thus determines the LMTO representation, and the first and the last equations 
(8) specify the linear transformation between representations. Values of the dia- 
gonal matrix l^(j) \ ^ exist, which yield short range for the Ist-order Hamiltonian 

H and, hence, for the LMTO set (1). Such an H is therefore a two-center TB 
Hamiltonian and such an LMTO set is a first-principles TB basis. 

In order to obtain an explicit expression for H, one needs to find the spherical- 
harmonics expansions about the various sites for a set of smooth MTO envelope 
functions. For a MT-potential, which is flat in the interstitial, the envelope fun- 
ctions are wave-equation solutions with pure spherical-harmonics character near 
the sites. Consistent with the idea behind the ASA - to use ’space-filling spheres’ 

- is the use of envelope functions with fixed energy, specifically zero, which is a 
reasonable approximation for the kinetic energy between the atoms for a valence 
state. The envelope functions in the ASA arc thus screened multipole potentials, 
with the screening specified by a diagonal matrix of screening constants, am, 
related to the radial overlaps om- The expansion of a hare multipole potential 




Developing the MTO Formalism 



7 



at site R about a different site R' is well known: 






R'L' 



Yl„m" (R^) 
|R'-R|'"+^ 



~ ^2 (^R') ^%l',rl ■ 

R'L' 



Here, I" = I' + I and m" = m' — rn. With suitable normalizations, the hare 
structure matrix, S°, can be made Hermitian. The screened structure matrix is 
now related to the bare one through a Dyson equation: 

(S“)-' = (S°)-'- a, (10) 

which may be solved by inversion of the matrix S° — This inversion may be 
performed in real space, that is in R- rather than in k-representation, provided 
that the screening constants take values known from experience to give a short- 
ranged S“. 

In the end, it turns out that all ingredients to the LMTO Hamiltonian and 
overlap integrals, H, (^(p \ (pj , and | <^^ , may be obtained from the screened 
Korringa-Kohn-Rostoker (KKR) matrix in the ASA: 



^R'L',RL = PrI (^) Sr'rSl'L ~ '^R'L'ML- (H) 

Here, p° (e) is a diagonal matrix of potential functions obtained from the radial 
logarithmic derivative functions, 9{(^(e, s)} = 91n |(/? (e, r)| /91nr|^ , evaluated 
at the MT-radius, and p“ (e) is related to p*^ (e) via the diagonal version of 
Equation (10). The results are: 



H = e,,-K 



- PP 



P "Sp^ 



c + dsSds , 



, I , \ K Ip 



K 

3! 



3! p ’ 



( 12 ) 



expressed in terms of the KKR matrix, renormalized to have AT = 1 : 
K (e) = k“5K(e)k“5 = p (e) p“^ — p“5S p-s . 



(13) 



This corresponds to the partial-wave normalization: p (r)^ r^dr = 1, and K (e) 

is what in the 2 nd-generation method [ 1 , 2 ] is denoted —h (ff) , but since the cur- 
rent notation identifies matrices by capitals, we cannot use h. The LMTO Hamil- 
tonian and overlap matrices are thus expressed solely in terms of the structure 
matrix S and the potential functions p (e) , specifically the diagonal matrices p, 
p, p, and P . It may be realized that the nearly-orthonormal representation is 
generated if the diagonal screening matrix in ( 10 ) is set to the value 7 , which 
makes p'’' vanish. 

For calculations [8-10] which employ the coherent-potential approximation 
(CPA) to treat substitutional disorder, it is important to be able to perform 
screening transformations of the Green matrix: 

G“ ( 2 ) = K“ [z)~^ = [p“ {z) - S“]"^ , 



(14) 
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also called the resolvent, or the scattering path operator in multiple scattering 
theory [11]. In the 2nd generation MTO forinalisin, G“ (e) was denoted g°^ (e) . 
This screening transformation is: 



G^(^) 



{/3 - a) 



P“ {z) 
pd (z) 



P° jz) 
pd (z) 



G“(^) 



P° (z) 
pd (z) ’ 



(15) 



and is seen to involve no matrix multiplications, but merely energy-dependent 
rescaling of matrix elements. As a transformation between the nearly orthonor- 
mal, /3=7, and the short-ranged TB-representation, Eq. (15) has been useful 
also in Green-function calculations for extended defects, surfaces, and interfa- 
ces [8,10,12-14]. However, calculations which start out from the unperturbed 
Green matrices most natural for the problem - namely those obtained from 
LMTO band-structure calculations in the nearly orthonornial representation for 
the bulk systems - have usually been limited to 2nd-order in z — e,j, because 
{z) is linear to this order, and because attempts to use 3rd-order expressions 

for p’’' (z) employing the potential parameter p"*^ = 3! p’’' \ , induced false 

poles in the Green matrix. 

What is not intelligible in the TB-LMTO-ASA method is that the LMTO ex- 
pansion (1) must include all L'’s until convergence is reached throughout each 
sphere, and all R”s until space is covered with spheres. This means that the 
LMTO-ASA basis is minimal - at most - for elemental, closely packed transi- 
tion metals, the case for which it was in fact invented [15]. The supreme com- 
putational efficiency of the method soon made self-consistent density- functional 
[16] calculations possible, and not only for elemental transition metals, but also 
for compounds. In order to treat open structures such as diamond, empty sphe- 
res were introduced as a device for describing the repulsive potentials in the 
interstices [17]. All of this then, led to misinterpretations of the wave-function 
related output of such calculations in terms of the components of the one-center 
expansions (1), typically the numbers of ,s, p, and d electrons on the various 
atoms (including in the empty spheres!) and the charge transfers between them. 
Absurd statements to the effect that CsCl is basically a neutral compound with 
the Cs electron having a bit of s-, more p-, quite some d-, and a bit of /-character 
were not uncommon. Many practitioners of the ASA method did not realize that 
the role of the MT-spheres is to describe the input potential, rather than the ou- 
tput wave- functions. For the latter, the one-center expansions truncated outside 
the spheres constitute merely a decomposition which is used in the code for 
sclfeonsistcnt calculations. The strange Gs electron is therefore little more than 
the expansion about the Cs site of the tails of the neighboring Cl p electrons 
spilling into the Cs sphere. That latter MT-sphere must of course be chosen to 
have about the same size as that of Cl, because only then is the shape of the 
Cs+Cl” potential in the bi-partitioned structure well described. 

Now, the so-called high partial waves - they are those which are shaped like 
r* in the outer part of the sphere where the potential flattens out - do enter 
the LMTO expansion (1), but not the eigenvalue problem (5) or the equivalent 
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KKR equation: 

K{ei)ci = 0, (16) 

because they are part of the MTO envelope functions. This property of having 
the high-^ limit correct is a strength of the MTO method, not shared by for 
instance Gaussian orbitals, which are solutions of (2) for a parabolic potential. 
There are, however, also other partial waves - like the Cs s-waves, d-waves 
in non-transition metal atoms, /-waves in transition-metal atoms, s-waves in 
oxygen and fluorine, and in positive alkaline ions, and all partial waves in empty 
spheres - which for the problem at hand arc judged to be inactive and should 
therefore not have corresponding LMTOs in the basis. In order to get rid of 
such inactive LMTOs, one must first - by means of (10) or (15) - transform to 
a representation in which the inactive partial waves appear only in the ’tails’ 
(second term of (1)) of the remaining LMTOs; only thereafter, the inactive 
LMTOs can be deleted. This down-folding procedure works for the LMTO-ASA 
method, but it messes up the connection between the LMTO Hamiltonian (7)- 
(13) and the KKR Green-function formalisms (12)-(16), and it is not as efficient 
as one would have liked it to be [2]. E.g., the Si valence band cannot be described 
with an sp LMTO basis set derived by down-folding of the Si d- as well as all 
empty-sphere partial waves [18]. 

The basic reason for these failures is that the ASA envelopes are chosen to 
be independent of energy - in order to avoid energy dependence of the structure 
matrix - because this is what forces us to carry out explicitly the integrals 
involving all partial waves in all spheres throughout space. What should be done 
is to include all inactive waves, (e,r) , in energy-dependent yVIO-envelopes, 
and then to linearize these MTOs to form LMTOs. This has been achieved with 
the development of the LMTO method of the 3rd- generation [19,20], and will be 
dealt with in the present paper. The reason why energy linearization still works 
in a window of useful width, now that the energy dependence is kept throughout 
space, is due to the screening of the wave-equation solutions used as envelope 
functions [21]. 

As an extreme example, it was demonstrated in Fig. 7 of Ref. [20] - and 
we shall present further results in Fig. 11 below - how with this method one 
may pick the orbital of one band, with a particular local symmetry and energy 
range, out of a complex of overlapping bands. This goes beyond the construc- 
tion of a Wannier function and has relevance for the treatment of correlated 
electrons in narrow bands [22,23]. Another example to be treated in the present 
paper is the valence and low-lying conduction-band structure of GaAs calculated 
with the minimal Ga spd As sp basis [24]. Other examples, not treated in this 
paper, concern the calculation of chemical indicators, such as the crystal-orbital- 
overlap-projected densities of states (COOPs) [25] for describing chemical pair 
bonding. These indicators were originally developed for the empirical Hiickel me- 
thod where all parameters have been standardized. When one tries to take this 
over to an ah initio method, one immediately gets confronted with the problems 
of representation. For instance, COOPs will vanish in a basis of orthonormal or- 
bitals. Therefore, the COOPs first had to be substituted by COHPs, which are 
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Hamiltonian- rather than overlap projections, but still, the LMTO-ASA method 
often gave strange results - for the above mentioned reasons [26]. What one has 
to do is - through downfolding - to chose the chemically- correct LMTO Hilbert 
space and - through screening - choose the chemically correct axes (orbitals) in 
this space. Only with such orbitals, does it make sense to compute indicators 
[27,28]. 

A current criterion for an electronic-structure method to be accurate and 
robust is that it can be used in ab initio density-functional molecular-dynamics 
(DF-MD) calculations [29]. According to this criterion, hardly any existing LMTO 
method - and the LMTO-ASA least of all - is accurate and robust. 

Most LMTO calculations include non-ASA corrections to the Hamiltonian 
and overlap matrices, such as the combined correction for the neglected integrals 
over the interstitial region and the neglected high partial waves. This brings in 
the first energy derivative of the structure matrix, S, in a way which makes the 
formalism clumsy [2]. The code [30] for the 2nd-generation LMTO method is 
useful [31] and quite accurate for calculating energy bands, because it includes 
downfolding in addition to the combined correction, as well as an automatic way 
of dividing space into MT-spheres, but the underlying formalism is complicated. 

There certainly are LMTO methods sufficiently accurate to provide structu- 
ral energies and forces within density-functional theory [8,9,34-36,7,38-40], but 
their basis functions are defined with respect to MT-potentials which do not 
overlap. As a consequence, in order to describe adequately the correspondingly 
large interstitial region, these LMTO sets must include extra degrees of freedom, 
such as LMTOs centered at interstitial sites and LMTOs with more than one 
radial quantum number. The latter include LMTOs with tails of different kine- 
tic energies (multiple kappa -sets) and LMTOs for semi-core states. Moreover, 
these methods usually do not employ short-ranged representations. Finally, since 
a non-overlapping MT potential is a poor approximation to the self-consistent 
potential, these methods are forced to include the matrix elements of the full 
potential. Existing full-potential methods are thus set up to provide final, nu- 
merical results at relatively low cost, but since they are complicated, they have 
sofar lacked the robustness needed for DF-MD, and their formalisms provide 
little insight to the physics and chemistry of the problem. 

One of the early full-potential MTO methods did fold down extra orbitals 
and furthermore contained a scheme by which the matrix elements of the full 
potential could be efficiently approximated by integrals in overlapping spheres 
[38]. The formalism however remained complicated, and the method apparently 
never took off. A decade later, it was shown [21,20] that the MT-potential, which 
defines the MTOs - and to which the Hamiltonian (4) refers - may in fact have 
some overlap: If one solves the exact KKR equations [41] with phase shifts calcu- 
lated for MT-wells which overlap, then the resulting wave function is the one for 
the superposition of these MT-wells, plus an error of 2nd order in the potential- 
overlap. This proof will be repeated in Eq. (28) of the present paper, and in 
Figs. 14 and 13 we shall supplement the demonstration in Ref. [20] that this 
may be exploited to make the kind of extra LMTOs mentioned above super- 
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fluous, provided that the MTO-envelopes have the proper energy dependence, 
that is, provided that 3rd generation LMTOs are used. Presently we can handle 
MT-potentials with up to ~60% radial overlap (s^ + sj^r < 1.6 |R — and 
it seems as if such potentials, with the MT-wells centered exclusively on the 
atoms, are sufficiently realistic that we only need the minimal LMTO set de- 
fined therefrom [20,42]. It may even be that such fat MT-potentials, without 
full-potential corrections to the Hamiltonian matrix, will yield output charge 
densities which, when used in connection with the Hohenberg-Kohn variational 
principle for the total energy [16], will yield good structural energies [43]. Hence, 
we are getting rid of one of the major obstacles to LMTO DF-MD calculations, 
the empty spheres. 

Soon after the development of the TB-LMTO-ASA method, it was realized 
[44] that the full charge density produced with this method - for cases where 
atomic and interstitial MT-spheres fill space well - is so accurate, that it should 
suffice for the calculation of total energies, provided that this charge density is 
used in connection with a variational principle. However, it took ten years before 
the first successful implementation was published [45]. The problem is as follows: 
The charge density, p (r) = \'^i (^)l i most simply obtained in the form of 

one-center expansions: 

/■ 

<pRL {z,Yr) lm.GRL,RL' {z) (pRL' {z,rR)* — , (17) 

R LL' ^ 



where G {z) = K (z) ^ , as can be seen from (1) and (3), but these expansions 
have terribly bad L-convergence in the region between the atoms and cannot 
even be used to plot the charge-density in that region. That was made possible 
by the transformation to a short-ranged representation, because one could now 
use: 



P (r) = E E 

RL R'L' 



/ ^^Grl,r'L' {z) 

J occ 



dz 

7T 



XR'L' (ffi')* I 



(18) 



where the L-sums only run over active values, and where the double-sum over 
sites converges fast. Nevertheless, to compute a value of xrl (r) with r far away 
from a site, one must evaluate the LMTO envelope function, which is a superpo- 
sition of the bare ones, Yr (tr) , and this means that (18) actually contains 
a 4-double summation over sites. At that time, this appeared to make the eva- 
luation of p (r) at a sufficient number of interstitial points too time-consuming 
for DF-MD, although the full charge density from (18) was used routinely for 
plotting the charge-density, the electron-localization function [46], a.s.o. In order 
to evaluate the total energy, the full charge density must also be expressed in 
a form practical for solving the Poisson equation. If one insists on a real-space 
method, then fast Fourier transformation is not an option. In Fig. 12 of the 
present paper, we shall present results of a real-space scheme [47,48] used in 
connection with 3rd-generation LMTOs for the phase diagram of Si [49]. This 
scheme is presently not a full-potential, but a full charge-density scheme, and 
the calculation of inter-atomic forces has still not been implemented. 
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With 3rd generation LMTOs [19,20], the simple ASA expressions (1)-(18) still 
hold, provided that 4> (e, r) is suitably redehncd, and that K (e) is substituted by 
the proper screened KKR matrix whose structure matrix depends on energy. The 
LMTO Hamiltonian and overlap matrices are given in terms of K, and its first 
three energy derivatives. A", K. and K, which are not diagonal. Downfolding, 
the interstitial region, and potential-overlap to first order are now all included 
in this simple ASA-like formalism [1]. In due course, we thus hope to be able 
to perform DF-MD calculations with an electronic Hamiltonian which is little 
more complicated than (7), (8), or (9). 

A hnal problem with the LMTO basis is that even with the conventional 
spd-hasis and space-filling spheres, the LMTO set is insufficient for cases where 
semi-core states and excited states must be described by one minimal basis set, 
and in one energy panel. This problem becomes even more acute in the 3rd- 
generation method where, due to the proper treatment of the interstitial region, 
the expansion energy must be global, that is, e,/ is now the unit matrix 
times rather than a diagonal matrix with elements £i,ri5rr'5ll' . The same 
problem was met when attempting to apply the formally elegant relativistic, 
spin-polarized LMTO method of Ref. [50] to narrow, spin-orbit split /-bands. 
Finally, as MT-spheres get larger, and as more partial waves are being folded 
into the MTO envelopes, the energy window inside which the LMTO basis gives 
accurate results shrinks. This means, that the 3rd-generation LMTO method 
described in [20] may not be sufficiently robust. 

The idea emerging from the LMTO construction (1) seems to be: Divide space 
into local regions inside which Schrodinger’s equation separates due to spherical 
symmetry and which are so small that the energy dependence of the radial 
functions is weak over the energy range of interest. Then expand this energy 
dependence in a Taylor series to hrst order around the energy Si, at the center 
of interest: / {e, r) ss / (r) + (e — e,,) / (r) . Finally, substitute the energy by a 
Hamiltonian to obtain the energy-mdependent LMTO. The question therefore 
arises (Fig. 1): Can we develop a more general, polynomial MTO scheme of 
degree N, which allows us to use an A^th-order Taylor series or - more generally 
- allows us to use a mesh of A^ + 1 discrete energy points, and thereby obtain 
good results over a wider energy range, without inereasing the size of the basis 
set 7 Such an NMTO scheme has recently been developed [51] and shown to be 
very powerful [24]. We shall preview it in the present paper. 

Most aspects of the 3rd-generation LMTO method have been dealt with in 
a set of lecture notes [19] and a recent review [20]. Here, we shall try to avoid 
repetition but, nevertheless, give a self-contained description of two selected 
aspects of the new method: the basic concepts and the new polynomial NMTO 
scheme, to be presented here for the first time. 

We first explain (Sect. 2) what the functions 4>{e,v) actually are in the 3rd- 
generation formalism. This we do using conventional notation in terms of sphe- 
rical Bessel functions and phase shifts - like in Ref. [21] - and only later, wc 
renormalize to the notation used in Refs. [19] and [20]. It turns out that the bare 
/’s are the energy- dependent MTOs of the 1st generation [52]. The screened (p’s 
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Fig. 1. Quadratic approximation to the energy dependence of a partial wave for a 
condensed (Taylor) and a discrete (Lagrange) mesh. 



are the screened, energy-dependent MTOs of the 2nd generation [21], with the 
proviso that = e. This proviso - together with truncations of the screening 
divergencies at the sites, inside the so-called screening spheres - is what makes 
the screened (j)’a equal to the so-called unitary [19] or kinked [20] partial waves 
in the formalism of the 3rd generation. Wo then derive the screened KKR equati- 
ons and repeat the proof from Refs. [21] and [20] that overlapping MT-potentials 
are treated correctly to leading (1st) order in the potential overlap. Towards the 
end of this first section, we introduce the so-called contracted Green function 
4> {e, r) G (e) , which will play a crucial role in the development of the polynomial 
NMTO scheme, and we derive the 3rd-geiieration version of the scaling relation 
(15) for screening the Green function. 

In Sect. 3 we show how to get rid of the energy dependence of the kinked- 
partial wave set: We hrst introduce a set of energy-dependent XMTOs, (S; ; 

which - like the (p {e, r) set - spans the solutions of Schrodinger’s equation for 
the chosen MT-potential, and whose contracted Green function, (e, r) G (e) , 
differs from (j) {e, r) G (e) by a function which is analytical in energy. Like in clas- 
sical polynomial approximations, we choose a mesh of arbitrarily spaced energies, 
£o, and subsequently adjust the analytical function in such a way that, 

X*-^^ (£o>r) = ... = X*"^^ (sAfjT) . The latter then, constitutes the set of energy- 
mdependent NMTOs. The Oth-order set, (r) , is seen to be the set of kinked 
partial waves, 0(eo,r) , at the energy Eq, and the Ist-ordcr set, x*'^^ (r) , to be 
the set of tangent or chord-LMTOs - depending on whether the mesh is conden- 
sed or discrete. For the case of a condensed mesh - which is the simplest - the 
matrices, which substitute for the energies in the Taylor series (1) - generalized 
to Nth order - turn out to be: 



(M-l) 

G 




-1 






(M - 1)! 



for 1 < M < X, 



(19) 
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in terms of the Mth and the (M — l)st energy derivatives of the Green matrix. 
Moreover, the expressions for tiie Hamiltonian and overlap matrices are: 
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( 20 ) 



which, for = 1, are easily seen to reduce to (7) upon insertion of (12). In 
retrospect, it is convenient that these basic NMTO results are expressed in terms 
of energy derivatives of the Green matrix G (e) - rather than in terms of those 
of its inverse, K (e), as we arc used to from the LMTO-ASA method (12) - 
because if we imagine generalizing (1) to A^th order and using it to form the 
Hamiltonian and overlap matrices like in (7), then each matrix will consist of 
N'^ terms, among which a number of relations can be shown to exist. We also 
realize, that the problem mentioned above about using Green matrices beyond 
2nd order in z — e,y, is solved by using - instead of G (z) — the NMTO Green 
function: 






(N) 
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(2N) 

G 

{2N)l 
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(2N + 1) 



(N) 



( 21 ) 



which equals G (z) to {2N + l)st order. This Green function has the additional 
advantage of allowing for a simple treatment of non-MT perturbations. We admit 
that this route to energy-independent MTO basis sets has little in common with 
the twisted path we cut the first time, but once found, it is easy to accept and 
understand the results - which are simple. 

In practice, it is cumbersome to differentiate a KKR matrix - not to speak of 
a Green matrix - many times with respect to energy. Hence, one uses a discrete 
energy mesh. With that, the derivatives in (19) and the pre- and post factors 
in (20) and (21) turn out to be divided differences, while those at the centers 
of (20) turn out to be the highest derivative of that approximating polynomial 
which is fitted not only to the values of G (e) at the mesh points, but also to its 
slopes. Hence, they are related to classical Hermite interpolation [53]. 

In both Sections 2 and 3, special attention is paid to the so-called triple- 
valuedness, because this was not previously explained in any detail, but has 
turned out to be crucial for the further developments and will be even more so 
when we come to evaluate the inter-atomic forces. A related aspect is the fact 
that a screening transformation in the formalism of the 3rd-generation is linear 
as regards the envelope functions, but non-linear as regards the NMTOs. This 
means, that changing the screening, changes the NMTO Hilbert space. This was 
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not the case for 2nd-generation LMTOs. This is the reason why we took care to 
denote the ncarly-orthonornial and orthonorinal LMTO sets arrived at by the 
linear transformations (8) and (9) by respectively x a-nd y, rather than by y'*' 
and y^, as in the 2nd-generation LMTO scheme, where screening transformati- 
ons were linear and denoted by superscripts. Screening transformations like (10) 
and (15) still hold for the 3rd-generation structure- and Green-matrices, but the 
partial waves providing the spatial factors of the Green function (see(17)) are 
different', they have tails extending into the interstitial region. A tail is attached 
continuously, but with a kink, at the screening sphere, which is concentric with, 
but smaller than, its own MT-sphere, and the resulting kinked partial wave, or 
Oth-order energy-dependent MTO, is - for the purpose of evaluating its pro- 
perties in a simple, approximate way - triple-valued in the shell between these 
two spheres. The radii, define the screening and determine the shape of the 
MTO envelopes. Now, for a superposition of kinked partial waves given by a so- 
lution of the KKR equations (16), the kinks and the triple- valuedness cancel, but 
for a single NMTO, a triple-valuedness of order (r — (cj — £q) ~ ^n) 

- which is the same as the error caused by the energy interpolation - remains. 
For this reason: The smaller the screening radii - i.e. the weaker the screening - 
the smaller the energy window inside which an energy-mdependent NMTO set 
gives good results. The extreme case is the bare (a — ?• 0) N = 0 set, which is 
the set of Ist-generation MTOs [52], but defined without freezing the energy de- 
pendence outside the central MT-sphere. The tail-cancellation condition for this 
set leads to the original KKR equations [41], which - we know - must be solved 
energy-by-energy, that is, the energy window can be very narrow, depending on 
the application. Specifically, for free electrons the width is zero. 

At the end of Sect. 3, wc demonstrate the power of the new NMTO methods 
by applying the differential and discrete LMTO, QMTO, and CMTO variational 
methods to the valence and conduction-band structure of GaAs using a minimal 
Ga spd As sp basis, and to the conduction band of GaCu02 using only one 
orbital, all others being removed by massive downfolding [24] . We also give simple 
expressions for the charge density and show the total energy as a function of 
volume for the various crystalline phases of Si calculated with the full-charge, 
differential LMTO method [47-49]. Finally, numerical results are presented for 
the error of the valence-band energy of diamond-structured Si - as a function of 
the potential overlap - obtained from LMTOs constructed for a potential whose 
MT-wells are centered exclusively on the atoms. In addition, results of a scheme 
which corrects for the error of 2nd order in the overlap will be presented [42]. 

In Sect. 4 we show that energy-dependent, linear transformations of the set 
of kinked partial waves - such as a normalization - merely leads to similarity 
transformations among the NMTO basis functions and, hence, does not change 
the Hilbert space spanned by the NMTO set. 

This is exploited in Sect. 5 to generate nearly orthonormal basis sets, (r) , 
for which the energy matrices defined in (19) become Hermitian, Hamiltonian 
matrices, We also show how to generate orthonormal sets, (r) , of 

general order, and we demonstrate by the example of the minimal MTO set 
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for GaAs that this technique works numerically efficiently - at least up to and 
including N = ‘6. This development of orthonormal basis sets should be impor- 
tant e.g. for the construction of correlated, multi-orbital Hamiltonians for real 
materials [23,54]. 

In the last Sect. 6 we show explicitly how - for = 1 and a condensed mesh 
- the general, nearly-orthonormal NMTO formalism reduces to the simple ASA 
formalism of the present Overview. 

In the Appendix we have derived those parts of the classical formalism for 
polynomial approximation - Lagrange, Newton, and Hermite interpolation - 
needed for the development of the NMTO method for discrete meshes [53]. 

2 Kinked Partial Waves 

In this section we shall define Oth-ordcr energy-dependent MTOs and show that 
linear combinations can be formed which solve Schrodinger’s equation for the 
MT-potential used to construct the MTOs. The coefficients of these linear com- 
binations are the solutions of the (screened) KKR equations. By renormalization 
and truncation of the irregular parts of the screened MTOs inside appropriately 
defined screening spheres, these Oth-order energy-dependent MTOs become the 
kinked partial waves of the 3rd generation. 

If we continue the regular solution ipm (e, r) of the radial Schrodinger equa- 
tion (2) for the single potential well, vr (r) , smoothly outside that well, it be- 
comes: 

= rii {nr) - ji{Kr) cotrim {£) = , ioi r > sr, (22) 

in terms of the spherical Bessel and Neumann functions, ji (kt) and rii (kt) , 
which are regular respectively at the origin and at inhnity, and a phase shift 
defined by: 

n (ks) cHn |(p (e,r)| /cHnrj^ — cHn jn («;r)| /91nr|g 
^ j (ks) cHn (e, r)| /01nr|^ — clln |j («;r)| /cllnr|^ ' 

In the latter expression, we have dropped the subscripts. Note that we no longer 
distinguish between ’inside’ and ’outside’ kinetic energies, e — v (r) and = 
£ — Kntz, and that we have returned to the common practice of setting Vmtz = 0. 
If the energy is negative, rq {kt) denotes a spherical, exponentially decreasing 
Hankel function. Note also that - unlike in the ASA - the radial function is not 
truncated outside its MT-sphere, and is not normalized to unity inside. In fact, 
we shall meet three different normalizations throughout the bulk of this paper, 
and (22) is the first. 
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Si p unscreened kpw 




Fig. 2. Bare Si p MTO according to Eq.(23) 



2.1 Bare MTOs 

The bare, energy-dependent muffin-tin orbital (MTO) remains the one of the 
1st generation [52]: 

4>RL (e, r) = Yl (r) (e, r) + j/ (nr) cot t]ri (e)] 

= Yl (r) I r < SR 

' ' ( n/ (/tr) for r > sr 

= Yl (r) [<fRi (£, r) - (e, r) + m (/tr)] , (23) 

and is seen to have pure angular momentum and to be regular in all space. 
The reason for denoting this Oth-order MTO 4> (e, r) , rather than (e, r) , 

should become clear later. 

In Fig. 2 we show the radial part of this MTO for a Si p-orbital, a MT-sphere 
which is so large that it reaches 3/4 the distance to the next site in the diamond 
lattice, and an energy in the valence-band, which - in this case of a large MT- 
sphere - is slightly negative (see Fig. 11 in Ref. [20]). The full line shows the MTO 
as defined in (23), while the various broken lines show it ’the 3-fold way’: The 
radial Schrodinger equation for the potential v (r) is integrated outwards, from 
the origin to the MT radius, s, yielding the regular solution, ip (e,r) , shown by 
the dot-dashed curve. At s, the integration is continued with reversed direction 
and with the potential substituted by the flat potential, whose value is dehned 
as the zero of energy. This inwards integration results in the radial function ’seen 
from the outside of the atom’, p° (e, r) , shown by the dotted curve. The inwards 
integration is continued to the origin, where p° (e, r) joins the ’outgoing’ solution 
for the hat potential, that is the one which is regular at inhnity: n {nr) . The 
latter is the envelope function for the bare MTO. 





18 



O.K. Andersen et al. 



As usual, the envelope-function for the MTO centered at R may be expanded 
in spherical-harmonics about another site R' R): 

KUi (KrR)YL (ffl) = ^Ji' (KrR')YL' [vr') Br,l',rl (e) , 

L' 

where the expansion coefficients form the Hermitian KKR structure matrix: 

(K|R-R'|)r;,,^„ (24) 

i” 

as conventionally [41] defined, albeit in i?-space. The spherical harmonics are 
as defined by Condon and Shortley, rn" = m' — m, the summation runs over 
I” = \l' — l\ , \l' — l\ + 2, ..., I' + I, and is real, because Cll'L" = 

fYLif)Y£,{r)YL"{f)df. 

If for the on-site elements of B (e) , we define: Brl rl> (e) = 0, and use 
the notation: /l {£,yr) = fi {Krn)YL {yr) , as well as the vector-matrix nota- 
tion introduced in connection with (7), we may express the spherical-harmonics 
expansion of the bare envelope about any site symbolically as: 

KTi (e, r) = j (e, r) B (e) + nn (e, r) . (25) 

If we now form a linear combination, YIrl 4>RL (e, yr) crl, of energy-dependent 
MTOs (23), and require that it be a solution of Schrddinger’s equation, then the 
condition is that, inside any MT-sphere (R') and for any angular momentum 
{L') , the contributions from the tails should cancel the (ky) cot riRni (e)-term 
from their own MTO, (pwL' {^,yr'), thus leaving behind the term {£,y) , 
which is a solution by construction. This gives rise to the original KKR equations 
[41]: 



[Br'L',RL (o) + K cot rjRi (Si) Sr'rSl'l] CRL,i 
RL 

= Kr'L',rl (£j) CRL,i = 0, (26) 

RL 

which have non-zero solutions, CRR^i, for those energies, £i, where the determi- 
nant of the KKR matrix vanishes. 

With those equations satisfied, the wave function is 

OO I' 

<pRL (£i, Yr) CRL,i = '^ Yu ^R'V isi,rR’)YL' {YR')cR'L',i + (27) 

RL — — 

YY [fRi {£i,rR) - ip°Ri (£i,rfl)] Yl (yr) CRL,i 

R=/^R' L 

near site R'. Since according to (22) the function (p ~ ip° vanishes outside its 
own MT-sphere, the terms in the second line vanish for a non-overlapping MT- 
potential so that, in this case, (27) solves Schrddinger’s equation exactly. If 
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the potential from a neighboring site (i?) overlaps the central site {R'), then 
‘fRL — Prl tongues stick into the MT-sphere at R'. The radial part of such a 
tongue is | {sr - vr) vr (sr) prl {sr) , to lowest order in sr - tr, as may be 
seen from the radial Sehrodinger equation (2). Let us now operate on the smooth 
functioir 'I'i (r) = i'h) crl,i i of which (27) is the expansion around 

site R' , with 77 — as given by (4) to find the error: 

(77 - £i) (r) = 

Y,VR,{rR>) Y, Y [PRI (£i, tr) - ip°Ri (£i, Tr)] Yl (ffi) CRL,i (28) 

R' R^R' L 

^ pairs 
RR' 

This shows that the wave function (27) solves Schrodinger’s equation for the 
superposition of MT-wells to within an error, which is of second order in the 
potential overlap [21,20]. 

2.2 Screened MTOs 

Screening is the characteristic of 2nd-generation MTOs and was hrst discovered 
as the transformation (8) to a nearly- orthonormal representation, in which the 
Hamiltonian is of seeond order [55,56]. Shortly thereafter it was realized that 
there exists a whole set of screening transformations which may be used to make 
the orbitals short ranged, so that the structure matrix may be generated in real 
space. It was also realized that the screening transformation could be used to 
downfold inactive channels and, hence, to produce minimal basis sets [1,18,44]. 
These applications were all for the ASA with k^= 0. Only long time after [21], 
did it become clear that screening would work for positive energies as well, and 
at that time a fourth virtue of screening became clear, namely, that sceening 
the range of the orbitals, simultaneously reduces their energy dependence to the 
extent that the full energy dependence may be kept in the interstitial region, thus 
making the K^=0-part of the ASA superfluous. Most of this was shown in the last 
paper on the 2nd-generation formalism [21]. Nevertheless, this paper was unable 
to devise a generally useful recipe for choosing the energy-dependent screening 
constants, it failed to realize that screening allows the return to: K^=e, and for 
those reasons it missed the elegant energy-linearization of the MTOs achieved 
by the 3rd generation. 

The sereened envelopes of the 2nd-generation method are linear superpositi- 
ons, 

n“ (£, r) = n (£, r) 5“ (£) , (29) 

of the envelope functions, n {e, r) , with the property that the spherical-harmonics 
expansions of the set of screened envelopes be: 



(sk' -rm) + {sr - vr) Vr {sr) iT', (r) 



nn (e, r) S'" (s) = Kn°‘ {e, r) = {s, r) B°‘ (e) -|- nn (e, r) , 



(30) 
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which are (25) with the substitutions: 



ji (ftr) ^ (e, r) = ji {nr) - m [nr) tan amm (e) , (31) 



and: B (e) B°‘ (e) , which will be determined below. In contrast to its bare 

counterpart, a screened envelope does not have pure angular momentum, i.e., 
cannot be factorized as a radial function times a spherical harmonics, and it 
depends explicitly on its surroundings. The background phase shifts a (e) - which 
may even depend on m (see for instance Fig. 11) - specify the shapes of the 
screened envelopes. Whereas the bare envelopes are regular in all space - except 
at their own site where they diverge like Yim (f) - the screened envelopes 

diverge at any site where there is a finite background phase shift in at least one 
L-channel. 

Note that only in the Overview did we use ASA k^= 0- notation with Greek 
letters denoting screening constants and S“ the structure matrix. In the bulk 
of the present paper, we use Greek letters to denote background phase shifts, 
and and 5“ to denote respectively the structure matrix and the screening 
transformation. 

We now find the screened structure matrix and the transformation matrix by 
expanding also the bare envelope on the left hand side of (30) by means of (25). 
Comparisons of the coefficients to KU^r {s, r^j/) and j^i (e, r^/) yield respectively: 

(£) = 1 _ (£) ^ and: (e) = B (e) 5“ (e) (32) 

K 

with the quantities regarded as matrices, e.g. fv^^tana is considered a diagonal 
matrix with elements taiiauL Srr'Sll'. As a result of (32): 



(s)"^ 



B{e)-^ 



tan a (e) 

K 



(33) 



which shows that, like the bare structure matrix, also the screened one is Her- 
mitian. In contrast to the bare structure matrix, the screened one has non- 
vanishing on-site elements. For background phase shifts known to give a short- 
ranged (e) , the inversion of the matrix B (e) + Kcota (e) , implied by (33), 
may be performed in real space, although the bare structure matrix is long- 
ranged. Eq. (33) is the k^=s equivalent of the ASA ’Dyson equation’ (10). 

For the inactive channels {RL = /) , wc choose the background phase shifts 
to be equal to the real phase shifts: 



ai (e) = rji (e) 



(34) 



so that for these channels, 

if (£, r) = ji {nr) - nj (/tr) tan r?/ (e) = -p} (e, r) tan r]i (e) . 

That is, we shape the set of screened envelope functions in such a way that, for 
the inactive channels, the radial functions, <p°j (e, r) , may be substituted smoothly 
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by the regular solutions, ipi (e, r) , of the radial Schrodinger equation. This is 
what we call downfolding. This substitution makes the screened envelopes be- 
come the so-called screened spherical waves, tp, of the 3rd-generation method. 
Only the screened spherical waves corresponding to the remaining, so-called ac- 
tive channels [RL = A) will be used to construct the MTO; they are: 

^’RLi^AR) = + (35) 

- ‘fii (e) , 

hi 

I 

which - in contrast to (e, r^) - are regular in all inactive channels, albeit 
irregular in the active channels. In (35), I = R'L'. Below, we shall choose to 
truncate the active channels inside their screening spheres. Due to the augmen- 
tation (substitution), the screened spherical waves do not transform linearly like 
(29). 

For the partial waves of high I, the phase shifts vanish due to the dominance 
of the centrifugal term over the potential term in the radial Schrodinger equation 
(2). As a consequence, the matrices involved in the Dyson equation (33) - whose 
indices run over all active as well as inactive channels - truncate above a certain 
I of about 3-4. 

Before specifying our choice of background phase shifts for the active chan- 
nels, let us define the energy-dependent, screened MTO analogous to the third 
equation (23) as the (augmented) envelope function, plus a term proportional 
to the function (p — ip° , which vanishes (quadratically) outside the central MT- 
sphere and has pure angular-momentum character. That is: 

^RL (e> !■«) = Yl (f fi) [pm (s, rn) - [e, tr)] (e, r^^) 

tan rjj^^ j 

= Yl (f fi) [pfii (s, vr) - p°Rf (s, vr)] + -ipf^L (£> ^r) (36) 

and RL e A. Here, the coefficient to p ~ p° has been chosen in such a way 
that, in its own channel and outside any other MT-sphere, the screened MTO is 
p°‘ + cot plus a term from the diagonal element of the screened structure 
matrix. 

To check this, we project onto the ’eigen-channel,’ making use of (35), (30), 
(22), and (31), and neglecting any contribution from pj {e,rR'Ys from overlap- 
ping neighboring MT-spheres: 



VrlYrl (^Ar) 



^Ri (e> rR) - p°r 7 (s, vr) + VrlYrl (e> ^r) 



\p — n + (j“ + n tan a) cot 77] 



tan?7 
tan r/“ 



+ j“cot77“ - n 



tan rj — tan a 
tan 77“ 



-i-n + f 



K 



<p“ + cot rf + 



ROC 

K 



(37) 
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For simplicity, we have dropped all arguments and indices in the last three lines. 
We see that the new phase shift, ?/“, is given by: 

tan rygi (e) = tan rjm (e) - tan aRL (e) , (38) 

as expected for the phase shift on the background of a. This is the same trans- 
formation as the one obtained from (33) for — (e) ^ . The definition of the 
renormalized free radial solution given in (36) may be written as: 

^RL (£, r) = m [rt) - jgi (£, r) cot (e) (39) 

= [ni {kt) ta.TLrjRi (e) - ji {nr)] cot (e) , 

and (e, tr) is the solution of the radial Schrodinger equation, normalized in 
such a way that it matches onto (fi°R% (e, r) at the MT radius, sr. The definition 
(39) reduces to (22) when a = 0. 

The set of screened MTOs now consists of the screened MTOs (36) of all 
active channels. Since the function has pure angular-momentum character, 

the mixed character of the screened MTO stems solely from the i/^-function. The 
result of projecting the screened MTO onto an active channel R' L' different from 
its own is seen from (30) to be: 

'Pr’L”^rl (e, Tfl) = Vr'L'iPrl (e, ^r) = J%l' («^: , (40) 

Av 

when tr! is so small that r lies inside only one MT-sphere, the one centered at 
R' . From (40) and (37) it is then obvious that, in order to get a smooth linear 
combination of screened MTOs, all j°^ -functions must cancel. 

This leads to the condition that the energy must be such that the coefficients 
can satisfy 

'^[B%A{si) + cot 5 A’ A\c%i = XI (o) = 0, (41) 

A A 

for all active R' L' = A' . These arc the screened KKR equations, and (e) is 
the screened KKR matrix. If these equations are satisfied, the linear combination 
of screened MTOs is: 

OO 

X (o, rij) cXi = X Z] 'f’R'L' (o> rm) Yl> {rR,)c%R,^i 4- (42) 

A I'—O m' 

X Z] ^ ''■«)] (^r) ^RR4 

L 

near site R! . As long as the MT-spheres do not overlap, this is a solution of 
Schrodinger’s equation for the MT-potential and, if the potentials overlap, then 
the ip — ip° tongues from the neighboring sites in the second line of (42) make 
the wave function correct to first order in the overlap [20]. This is exactly as 
in (27). The summation over spherical-harmonics around the central site inclu- 
des the contributions —pi (e, tr/) k~^ tan 77 / (e) Bf (e) c[| ^ provided by the 
screened-spherical-wave part of the MTO (see (36) and (35)). 
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Although energy-dependent MTO sets with different screenings are not li- 
nearly related, they all solve Schrodingcr’s equation for the MT-potential used 
for their construction via the corresponding KKR equation. E.g. had one chosen 
a representation in which a channel making a significant contribution to a wave 
function 'I'i (r) with energy Si = e were downfolded, then the corresponding so- 
lution of the KKR equation (41) would arise from (e) being long ranged and, 
as a function of £, going through a zero-pole pair near £*. If the energy were now 
fixed at some energy and the energy-independent set 0“ (£iy,r) were used as 
the Oth-order MTO basis in a variational calculation, then a useful result could 
in principle be obtained, but only if were chosen very close to £j. 



Si p kinked partial wave 




Fig. 3. Si Pill member of a screened spd-set of Oth-order MTOs (see text and 
Eqs.(36),(44)-(47)). 



2.3 Hard-Sphere Interpretation and Redefinitions 

We now wish to choose the background phase shifts for the active channels in 
a way which reduces the spatial range and the energy dependence of the MTO 
envelopes. It is obvious, that for the orbitals to be localized, they must have 
energies below the bottom of the continuum of the background - defined as the 
system which has the same structure as the real system, but has all phase shifts 
equal to those of the background. Hence, the active a(£)’s should be defined 
in such a way that the energy band defined by: |R° (£) + Kcota(£)| = 0, lie as 
high as possible. 

The discovery of a useful way of determining this background, turned out to 
be the unplamred birth of the 3rd MTO generation [19,20]. Realizing that the 
weakest point of the ASA was its solution of Poisson’s - and not Schrodinger’s - 
equation, and unhappy with the complexities of existing full-potential schemes, 
we [57] were looking for those linear combinations of Hankel functions - like (29) 
- which would ht the charge density continuously at spheres. With Mcthfessel’s 
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formulation [35] : What we wanted was those solutions of the wave equation which 
are Yl (f^j) at their own sphere and for their own angular momentum, and zero 
at all other spheres and for all other angular momenta. This set was therefore 
named unitary spherical waves. The solution to this boundary-value problem is 
of course a particular screening transformation (33). 

Our way of defining the background was thus in terms of hard screening- 
spheres for the active channels; the larger the screening spheres, the larger the 
excluded volume and the higher the bottom of the continuum. The screening 
spheres are not allowed to overlap - at least not if all 1-channels were active, 
because then a unitary spherical wave would be asked to take both values, 1 
and 0, on the circle common to the central and an overlapping sphere. As a 
consequence, in order to reduce the range and the energy dependence of the MTO 
envelope functions, the screening spheres should in general be nearly touching. 
Now, since the screening radii, , control the shapes of the envelopes, the relative 
sizes of the screening spheres should be determined by chemical considerations, 
i.e. the a’s may be covalent- or ionic radii in order that results obtained from an 
electronic-structure calculation be interpretable in terms of covalency, ionicity 
etc. Referring to the discussion in the Overview, one could say: The MT-spheres 
(s) are potential-spheres and the screening-spheres (a) are charge-spheres. 

Inspired by Ref. [21], practitioners of multiple-scattering theory - who tra- 
ditionally take the Kohn-Rostoker [41] Green-function point of view - found 
another useful way of determining the background phase shifts, namely in terms 
of repulsive potentials [58]. 

For a given active channel {RL = A), the radial positions, r = oa (e) , of 
the nodes of the background functions given by (31) are the solutions of the 
equation; 

0 = Ja (e, (e)) = Ji {n-aA (e)) - ni (kua (e))tanaA (e) . 

Whereas attractive potentials usually do not give positive radii - for an example, 
see the dotted curve in Fig. 2 - repulsive potentials do, as may be seen from the 
radial Schrodinger equation (2). For a hard repulsive potential, the position of the 
node is independent of energy and of 1. What we shall use for the active channels 
are therefore screening-sphere radii, oa, which are independent of energy and 
which usually depend little on L among the active channels. In terms of such a 
screening radius, the corresponding background phase shift is given by: 

tan q;a (e) = ji {naA) /ni (kua) . (43) 

Now, instead of having screened spherical waves (35) and MTOs (36) whose 
active channels are irregular at the origin - the irregularities of the inactive chan- 
nels were already gotten rid of by downfolding, followed by tp} (e, r) ipj (e, r) 
substitutions - we prefer that the active channels have merely kinks. This is 
achieved by truncating all active j“-functions inside their screening spheres, 
that is, we perform the substitution: 

J 0 for r < a A 

\ Ji (Kr) - ni {nr) ji (koa) /ni (koa) for r >ua ' 



Ja (e> r) -A 



(44) 
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which is continuous but not differentiable, for the screened spherical waves and 
for its own j "-function of the MTO - that is the second term on the last two lines 
of (37). With that substitution, a screened spherical wave, > vanishes 

inside all screening spheres of the active channels - except inside its own, where 
it equals n/ {ktr) Yl (?«) • This may be seen from (40) and the two first lines of 
(37). Finally, if we renormalize according to: 

i’RL (e, tr) = (e, r^) /rii (kohl) (45) 

- note the difference between the superscripts a and a - we finally arrive at the 
screened (unitary) spherical wave as defined in Refs. [19,20]. 

{e, rn) is that solution of the wave equation which is Yl (f a) on its own 
screening sphere, has vanishing Yl' (f^{/)-average on the screening spheres of the 
other active channels, and joins smoothly onto the regular solutions of the radial 
Schrodinger equations of the inactive channels. In those, the regular Schrodinger 
solutions are, in fact, substituted for the wave-equation solutions. 

It is now obvious, that overlap of screening spheres will cause complicated, 
and hence long-ranged spatial behavior of the screened spherical waves, and the 
worse, the more spherical harmonics are active. 

With the normalization (45), there is apparently no need for functions, like 
spherical Bessel and Neumann or Hankel functions, which have a branch-cut at 
zero energy, and this was the point of view taken in the first accounts [19,20] of 
the 3rd-generation method. However, the normalization (45) is not appropriate 
for a=0, and expressing the screened structure matrix in terms of the bare one 
(24) - which is the only one computable in terms of elementary functions - was 
slightly painful in Ref. [19]; moreover, in that paper downfolding was not presen- 
ted in its full generality. In these respects, the present, conventional derivation 
is simpler, but it takes more equations. 

With the a — > a redefinitions (44)-(45), the MTO remains as defined by 
(36), but with the screened spherical waves and its own j"-function truncated 
as described above. We may also renormalize the MTO like in (45): 

<pRL (e> !■«) = (p%L (e> !■«) /m {naRL) , (46) 

whereby these energy-dependent Oth-order MTOs become identical with the kin- 
ked partial waves of Refs. [19,20]. This normalization corresponds to: 



p°j,Ue,aRL)^l. (47) 

Note that this will cause the normalization of the radial Schrodinger-equation 
solution, (/3“ (e, r) , to depend on m in case the corresponding screening radius is 
chosen to do so. 

In Fig. 3 we show the screened counterpart of the bare Si p orbital in Fig. 
2. Since only the two first terms of (36) - but not the screened spherical wave 
- has pure angular momentum, we cannot plot just the radial wave function 
like in Fig. 2. Rather, we show the MTO together with its three parts along the 
[lll]-line between the central atom and one of its four nearest neighbors in the 
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diamond structure. The positions of the central and the nearest-neighbor atoms 
are indicated on the axis (Si), and so is the intersection with the central MT- 
sphere (s). The p orbital chosen is the one pointing along this [111] direction. 
The Si spd channels were taken as active, and to have one and the same screening 
radius, a = 0.75t, where t is half the nearest-neighbor distance, i.e., the touching- 
sphere radius. The places where the central and the nearest-neighbor screening 
spheres intersect the [lll]-line are indicated by a’ and ’a — >■’ with the arrow 
pointing towards the respective center. We see that the central MT-sphere is so 
large, that it overlaps the screening sphere of the neighboring atom. Like in Fig. 
2, the full curve shows the MTO (0“), and the dot-dashed {p°'Y), the dotted 
and the dashed (V’“) curves show the three terms in the renormalized 
version of equation (36). The dot-dashed and the dotted curves are identical with 
those in Fig. 2, except for the normalization; they arc the outwards-integrated 
solution {p°'Y) of the radial Schrodinger equation, continued by the inwards- 
integrated solution {p°°'Y) for the flat potential. These two curves have been 
deleted outside the central MT-sphere where their contribution to the MTO 
(36) cancels. The inwards integration ends at the screening sphere, inside which 
if°°' — with truncated - cancels its own-part, n; (nr) jni [na) , of the screened 
spherical wave, ijj, shown by the dashed curve (see Eqs. (37) and (44)). Neither 
of these cancelling parts are shown in the figure, and the dashed curve inside the 
central screening sphere therefore merely shows the contribution to the screened 
spherical wave from the inactive channels (? > 3). Due to the ^“-truncations, 
the screened spherical wave has kinks at all screening spheres and, inside these 
spheres, only the contribution from the inactive partial waves - which are regular 
solutions of the radial Schrodinger equations - remain. The full curve is the 
MTO, which is identical with the screened spherical wave outside its own MT- 
sphere. At its own screening sphere, its kink differs from that of the screened 
spherical wave due to the truncation of the ^“-contribution to Compared 
with the bare MTO in Fig. 2, the screened MTO in Fig. 3 is considerably more 
localized, even though a negative energy was chosen. 

If one demands that the valence band - as well as the lower part of the 
conduction band - of Si be described from first principles using merely the 
minimal 4 orbitals per atom, one cannot use a set with p orbitals such as those 
shown in Figs. 2 and 3; the d-MTOs must be folded into the envelopes of the 
remaining sp set by use of the appropriate structure matrix obtained from Eq. 
(33) with the choice (34) for the Si d-channels. The corresponding Si pm-MTO 
is shown in Fig. 4. Little is changed inside the central screening sphere, but the 
tail extending into the nearest-neighbor atom has attained a lot of d-character 
around that site, and the MTO is correspondingly more delocalized. 

The Si piii-MTO for use in an sp MTO basis constructed from the con- 
ventional Si+E potential - for which the diamond structure is packed bcc with 
equally large space-hlling spheres - is obtained by down-folding of the Si d and 
all empty-sphere channels. It turns out to be so similar to the one obtained from 
the fat Si-centered potential shown in Fig. 4, that we will not take the space to 
show it. 




Developing the MTO Formalism 



27 



Whereas the bare MTO in Fig. 2 is what has always been called a bare 
MTO, the screened ones in Figs. 3 and 4 look more like a partial wave, ^pY, with 
a tail attached at its own screening sphere - and with kinks at all screening 
spheres. Hence the name ’kinked partial wave’ given in Ref. [19]. In this original 
derivation, kinked partial waves with a = s < t were considered first, and only 
later, the limiting case a — ^ 0 gave rise to a painful exercise. The kinked partial 
waves have in common with Slater’s original Augmented Plane Waves (APWs) 
[59], that they are partial waves, (p{s,r)Y, of the proper energy inside non- 
overlapping spheres, which are joined continuously - but with kinks - to wave- 
equation solutions in the interstitial. In that region, the APW is a wave-equation 
solution with a given wave-vector, whereas the MTO is a solution with the 
same energy. Moreover, whereas the APW method uses identical potential and 
augmentation spheres, this is not the case for MTOs. 

If - for the third time in this section - we make a linear combination of MTOs 
- this time dehned with kinks - and demand that it solves Schrodinger’s equation, 
then the condition is, that the kinks - rather than the j“-functions - from the 
tails should cancel the ones in the head. This condition is of course equivalent 
with the one for j“-cancellation. Nevertheless, let us express the KKR equations 
in this language because it will turn out to have three further advantages: The 
artificial dependence on k = y/e and the associated change between Neumann 
and decaying Hankel functions will disappear, there will be a simple expression 
for the integral of the product of two MTOs, and we will be led to a contracted 
Green function of great use in the following section. 

Since the kinks arise because the j“-functions are truncated inside their 
screening spheres, the kink in a certain active channel of an MTO is propor- 
tional to the slope of the corresponding j“-function at a+. An expression for 
this slope is most easily found from the Wronskian, which in general is defined 
as: [/ (r) g' (r) — g (r) f (r)] = {/, g}^ , and is independent of r when the two 

functions considered are solutions of the same linear, second-order differential 



Si p kinked partial wave 




Fig. 4. Si pill member of a screened minimal sp-set of Oth-order MTOs (see text). 
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equation. As a consequence, = {n,j — ntana} = {n,j} = —k and 

therefore: 

dr (e, r) !dr\^^ = - [a^Kn (ko)] . (48) 



We now define the elements (e) - where R' L' and RL both refer to 

active channels - of a kink matrix [19,20] as times the kink in the R'L'- 
channel of (£, r^) . From the expression for 9j“ /dr \^^ , the last forms of the 
spherical-harmonics expansions (37) and (40), the dehnition (41) of the screened 
KKR matrix, and the renormalization (46), this is seen to be: 



K' 



R'L'ML 



(£) 



-K 



R'L',RL 



(e) 



nnr {RawL') Rni {naRL)' 



(49) 



Note that this is the kink matrix as defined in Ref. [20], whereas the one defined 
in Ref. [19] has the opposite sign. As presently defined, the energy derivative of 
the kink matrix is positive definite, as we shall se in the next section. 

Screening and the dehnition (49) have removed the spurious energy depen- 
dencies of (e) . To see this more clearly, let us use the hrst - rather than the 

last - forms of the spherical-harmonics expansions (37) and (40), which are also 
more closely related to the dehnition (36) of the MTO, and to Figs. 3 and 4: The 
kink matrix for is — [KUr {rra')] ^ (e) [kui [naA]] Moreover, 

i^A (^j ^r) contains the diverging term n {nr) jn {na) in its own channel, which 
in the MTO is being cancelled by a term from (see the third equation (37) 
and (38)). The kink matrix for the MTO set is now seen to equal the one for 
the set of screened spherical waves, plus - in the diagonal - the kink in the 
function (/?“ — (p° “ + n {nr) jn [na] . Since ^ is smooth, this kink is the one 
between the radial functions “ (e, r) and n («:r) jn (na) . We thus arrive at the 
expression: 



K°- 

-O R/ Tt 



R'L'ML 



r = - 



Br'L',RL (^) 

Knv {nawL’) Rni (naRL) 

+RRL [0 {ni (e, a)} - d {Vi (£> «)}] Sr'R^L'L 



(50) 



d 



— ^R'L' -^'Br!L'4’rl 



- 0^9 {(fj (£, a)} Sr'rSlil 



Br'L',Rl{^) - a,RLd{ipj ^R'R^L'L, 



(51) 



in terms of the logarithmic- derivative function at the screening sphere of the 
inwards-integrated radial function, 9{y:°(£,a)} = 91n (e, r)| /91nr|^. Re- 

member that RL and R' L' refer to active channels. 

In the third line of (50) we have pointed to the fact that the hrst, potential- 
independent part of the kink matrix is a\, times the outwards slope of the scre- 
ened spherical wave and in (51) we have denoted this slope matrix (e) . 

Note that, as presently dehned, this slope matrix is Hermitian and equals aRiRr 
times the non-Hermitian slope matrix dehned in Refs. [19,20]; moreover, the 
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transformation from to is not quite (49), but differs from it by the term 
adhi \ni («:r)| /91nr|^. Wc may switch from Neumann to Bessel functions, using 
again that ji (ko) = rii (ko) tana, and that {j, n} = 1/k. We get: 



(e) 



tana(e) ^ Kcota(e)] + ad {j [ko)} 



Kj ( ko ) 

[I? (e) + Kcot a (e)]^^ 



J i^a) 



j {kg) 



Kj {kg) 

ad{j{Ka)}, 



(52) 



where the last equation has been obtained with the help of (33), and where 
B (e) = (s) is the bare KKR structure matrix (24). The matrix B (e) + 

K cot a (e) is the bare KKR matrix for the background-potential and has dimen- 
sion (A + I) ; it only truncates when a/ (e) = rjj (e) = 0, as it happens for high 
1 . 



Computational Procedure. The recipe for a computation could be: Solve 
the radial Schrodinger equations outwards, and then inwards to a ~ 0.8t, for 
all channels up i < 3. Then, compute the Green matrix of the background, 
Qa=o (g) -t Kcota (e)] , by inversion in real space, choosing the 

strong screening just mentioned, i.e. nearly touching screening spheres for all 
spd (/) channels. This gives the strongly screened structure matrix, (e) or 
B°~ (e) , according to (52), and the KKR matrix, K°‘ (e) or (e) , for the real 
potential in the strongly screened representation according to (41) or (51). For 
a crystal, Bloch-sum the KKR matrix. Now, invert this matrix in real space to 
obtain the Green matrix, (e) = K°^ (s)^^ or G“ (e) = (e)^^ • Next, choose 

the physically and chemically motivated screening (/?) and rescreen the Green 
matrix to the downfolded representation, (e) or G^ (e) , using the scaling 
relations (53) or (55) derived below. As will be explained in the following Sect. 
3, this should be done for a number of energies. In addition, one will need the first 
energy derivatives G^ (e) . The latter may be obtained from (e) via numerical 
differentiation of the weakly energy dependent structure matrix, (s) , and 
calculation of (e, r) r'^dr — (£, r) r'^dr for the energy derivative of 

the logarithmic derivative function in (51), as will be shown in (61)-(63) below. 
With this (e) , compute (e) from (63) and, finally, rescreen to & (e) using 
the energy derivative of (55) given below. 

In order to evaluate the wave function (42), one needs in addition to B\,j^ (e) , 
the block B\j^ (e) , and this may be obtained from (52). 

The relation of the screening constants, the structure matrix, and the KKR 
matrix to those - see (10) and (11) - of the conventional ASA is simple, but not 
as straightforward as the a-to-a transformations of the present section, so for 
this topic we refer to Refs. [19,20]. 

This completes our exact transformation of the original KKR matrix (26) 
which has long range and strong energy dependence [B'^ (e, k) has poles at the 
free-electron parabola: |k -|- G| =£ ] to a screened and renormalized KKR 
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matrix which - depending on the screening - may be short ranged and weakly 
energy dependent. The kink matrix is expressed in terms of a slope matrix, 
which only depends on the energy and the structure of the background, and the 
logarithmic derivatives of the active radial functions extrapolated inwards to the 
appropriate screening radius. 



2.4 Re-screening the Green Matrix 



In the ASA, it is simpler to re-screen the Green matrix (15) than the structure 
matrix ( 10 ), because the former involves additions to the diagonal and energy- 
dependent rescaling of rows and columns, but no matrix inversions. The same 
holds for the fully energy-dependent matrices of the 3rd-generation, as may be 
seen from (33) or (52) for the structure matrix. For the Green matrix (41), we 
get with the help of (52) and a bit of algebra: 

G“ (e) = (e) ^ tana (e) [1 — tan a (e) cot rj (s)] 

+ [1 — tan a (e) cot r/ (e)] (e) [1 — tan a (e) cot r/ (e)] , 

which has the form (15). Solving for (e) and setting the result equal to 

G^ (e) yields the following relation for re-screening of the Green matrix: 

0 ^ tan? 7 ^ (g) . tan? 7 ^ (e) _ tana(g) - tan /3(g) tan??^ (g) 

tanr/“(g) ^^tanr/“(g) k tanr/“(g)' ^ ^ 

In a-language, where according to (49): G“ (g) = — nn {kg) (e) Kn (na) , the 

diagonal matrices in (53) become [n (nb) /n (/ta)] [tan r/^ (g) / tanr/“ (g)] and 
Kn (ko) n {nb) [tana (g) — tan/3 (g)] and may, in fact, be expressed more simply 
in terms of the inwards-integrated radial wave function, renormalized according 
to (47). In order to see this, we first use the form (39): 



V.°“(g,r) 



n (nr) tan rj^ (g) — j“ (g, r) 



n (ko) tanr/“ (g) ’ 

and then evaluate this at the screening-radius b : 

n {Kb) tan r/“ (g) — (g, b) n (Kb) tan rf (g) 



‘^°“(e,b) = 



n (kg) tan 77 “ (g) 



n (kg) tan 77 “ (g) 

To obtain this result, we have also used: 

j“ (g, b) = j (Kb) — n (Kb) tan a (g) = n (Kb) [tan /3 (g) — tan a (g)] , 

from (31) and (43). The second, readily computable function is that solution of 
the radial wave equation which vanishes at a with slope Xjd^ \ 



f (£,r) = 



(£,r) 



a^dj°‘ (g, r) /9r| 



= —Kn (kg) (g, r) . 



(54) 
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Evaluation at r = 6 yields: 

(e, 6) = —Kn {ko) (e, h) = nn (na) n (Kb) [tan a (e) — tan /? (e)] , 

which is the second function needed. Hence, we have found the following simple 
and practical scaling relation for re-screening of the Green matrix: 

G" (e) = “ (e, b) G“ (e) “ (e, b) + f (e, b) “ (e, 6) . (55) 

2.5 Green Functions, Matrix Elements, and Charge Density 

The kinked partial wave is the solution of the inhomogeneous Schrodinger equa- 
tion: 



{n - s) (p%L, (£, r) = - ^ d {tr - url) Yl {tr) XKL.fl'L' (s) > (56) 

RL 

provided that we define the MTO (36) the 3-fold way indicated in Figs. 2 - 
4, and therefore - for the MT-Hamiltonian % (4) - use the radial Schrodinger 
equation (2) channel- wise. 

The kinks of the MTO are given correctly by (56), but the proper MTO 
does not solve Schrodinger’s differential equation in the shells between the scre- 
ening and the MT-spheres; here we need the 3-fold way. This way must not be 
an approximation: For instance, when applied to those linear combinations of 
MTOs which solve the KKR equations - and hence Schrddinger’s equation - 
equation (56) is correct (and yields zero), because for each active channel. A! , 
the two solutions, Va' '^a (^> ^r) ^‘a (^> ^R') radial wave 

equation match in value and slope at or/ , and therefore cancel throughout 
the shell sr> — aR'R'. Expressed in another way: For energy-dependent MTOs, 
kink-cancellation leads to cancellation of the triple- valuedness. For the energy- 
independent NMTOs to be derived in the next section, special considerations 
will be necessary. 

Solving (56) for 6 (j'r - url) Yr (vr) , leads to: 

^ ^‘r'L' (^i !■) G%L',rl (^) = {’’’R ~ o-Rl) Yr (fi{) (57) 

R'L' 

which shows that the linear combinations 

Irl (Sj ^%L' {^i !■) Gr,l, RL (e) , (58) 

R'L' 

of MTOs - all with the same energy and screening - is a contraction of r' onto 
the screening spheres (r' — > rrr , RL) of the Green function defined by: 



(LLr — e) G (e; r, r') = — <5 (r — r') . 
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The contracted Green function 7 ^^ (c, r) has kink 1 in its own channel and kink 
0 in all other active channels RL) . This function is therefore a solution of 
the Schrddinger equation (dehned the 3-fold way) which is smooth everywhere 
except at its own screening sphere. 7 ^^ (e, r) is usually delocalized, and when 
the energy, e, coincides with a pole, £j, of the Green matrix, 7 ^^ (£;r) diverges 
everywhere in space. This means, that when e = Sj, then the renormalized 
function is smooth also at its own sphere, and it therefore solves Schrodinger’s 
equation. In vector-matrix notation, equations (56) and (57) become: 

(H -£)(/)“ (e, r) = (r) (£) , 

CH - £) 0“ (£, r) G“ (£) = {H-e) 7 “ (£, r) = (r) , 

where we have defined a set of spherical harmonics on the a-shells with the 
following members: 



^RL (^r) = ^ i'f'R ^ ^Rl) Yl {^r) ■ 



(59) 



If expressed in real space, our Green matrix, G“ (£) , is what in multiple- 
scattering theory [ 11 ] is usually called the scattering path operator and denoted 
T (e). In the 2nd-generation LMTO formalism, it was denoted g (e) , but in the 
present paper we denote matrices by capitals. 

Since in the 3-fold way, an MTO takes the value one at its own screening 
sphere and zero at all other screening spheres, expression (56) yields for the 
matrix element of ?{ — s with another, or the same, MTO in the set: 

(<^R'L' (s) 1”^ ^ <I>RL i^)) = ~^R'L',RL (^) = ~^R'L'ML i^) i (60) 



which says that the negative of the kink matrix is the Hamiltonian matrix, minus 
the energy, in the basis of energy-dependent Oth-order MTOs. 

For the overlap integral between screened spherical waves, with possibly dif- 
ferent energies and in the interstitial between the screening spheres, defined 
channel-by-channel, we obtain the simple expression [19]: 



{4’R'L' {e')\rRL {e)} 



^R'L',RL (^0 “ Bf^,L, RL (e) 



£' — £ 



Br'L'ml (s) if 



(61) 



by use of Green’s second theorem, together with expression (51) for the surface 
integrals. Note that, neither active channels different from the eigen-channels, 
R' L' and RL, nor the inactive channels contribute to the surface integrals. The 
reasons are that (£^^) and (SiT) vanish on all ’other’ screening sphe- 

res, and that they are regular in the inactive channels. The latter means that, 
in the inactive channels, the ’screening-sphere interstitial’ extends all the way 
to the sites (a/ ^ 0). For the overlap integral between kinked partial waves, the 
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3-fold way yields: 



I ^RL (s)) = ii’R'L' (^0 I V'fiL (s)) + ^R’rSl'L X 

O rsii rsji 

‘fRL (e'> r) (fRL (£> r) r^dr - / Lp°j{i (e', r) (e, r) r^dr 
0 clrl 

^R'L',RL (^0 ^ Kr'L'.RL {^) r>a ^ ^ -r / 

^ ^ Kr'L’,RL (s) if £ ^ 



(62) 



For the overlap matrix for the set of contracted Green functions, this gives; 



(7“(e')l7“(£)) = 






e' — e 

-G“(£) = G“ (e) F:“ (e) G“ (e) 



if e' — > e. 



(63) 



We see that 5“ (e) , (e) , and G“ (e) are Hermitian, just like (e) , (e) , 

and G°'(e). Whereas (e) and (e) are positive definite matrices, that is, 
their eigenvalues are positive or zero, G“ (e) is negative definite. For well-screened 
MTOs, the logarithmic derivative functions in the diagonal of the kink matrix 
(51) depend more strongly on energy than the slope matrix. The way to com- 
pute the energy derivative (e) is therefore to compute i?“ (e) by numerical 
differentiation, and the remaining terms by integration as in (62). 

In the following we shall stay with the normalization (45)-(47) denoted by 
Latin - rather than Greek - superscripts and shall rarely change the screening. 
We therefore usually drop the superscript a altogether. Some well-screened re- 
presentation is usually what we have in mind, but also heavily down-folded - 
and therefore long-ranged - representations will be considered. In those cases, 
some parts of the computation must of course be performed in the Bloch - or 
k-spacc - representation. 

The wave function is (r) = <p{ei,r)ci, where the eigen(column)vector 
Ci solves the KKR equations, K {si) Ci = 0, and is normalized according to: 
1 = c|FT (si) Ci, in order that | >Fi) = 1. From the definition (36) of the MTO, 
we see that an accurate approximation for the charge density, which is consistent 
with the 3-fold way and, hence, with the normalization, has the simple form: 



P (r) = P* (r) + \pr {vr) - (rfl)] (64) 

R 



where the global contribution is: 

RR' LL' 




i^RL (£,yr) Frl,r'L' (e) •’pR'L' (e,rR,)* de (65) 
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and the local contributions, (r^) — p'^ (r^) , which vanish smoothly at their 
respective MT-sphere, are given by: 

Pr (’') = X] ^RL,RL' {e) VRi’ (£, r) de 

Pr (’') = X] (f ) [ ^°RI (£> r) rRL,RL' (£) ^Ri' {£, r) de . (66) 

The common deiisity-of-states matrix in these equations is: 

occ ^ 

^RL,R'L' (e) = CRL,i^ - ^i)cR'L' i = —^"^GrL^R'U • (67) 

A — / ’ 7J- 

i 

The approximations inherent in (64) are that all cross-terms between products 
of xp-, (f-, and (/^“-functions, and between if- or (/^“-functions on different sites are 
neglected. 

3 Polynomial MTO Approximations 

In this section we shall show how energy-mdependent basis sets may be derived 
from the kinked partial waves, that is, how wc get rid of the energy dependence 
of the MTOs. Specifically, we shall preview the generalization [51,24] of the 
3rd-generation LMTO method [19,20] mentioned in connection with Fig. 1. This 
generalization is to an ’N’MTO method in which the basis set consists of energy- 
independent NMTOs, 



N 

^RL (’') = ^ ^R'L' ^R'L\RL-n > 

n— 0 R' L' 

N 

where ^^r uml-u = ^R'R^l’l, 

n—0 

constructed as linear combinations of the kinked partial waves at a mesh of 
+ 1 energies, in such a way that the NMTO basis can describe the solutions, 
'Pi (r) , of Schrodinger’s equation correctly to within an error proportional to 
(£i — £o) (si “ £i) ••• (si “ ^n) ■ Note the difference between one-electron energies 
denoted £i and £j, and mesh points denoted £„ and Em, with n and m taking 
integer values. The set, (r) , is therefore simply 0(£o,r) , and this is the 

reason why, right at the beginning of the previous section, (p {e, r) was named the 
set of Oth-order energy-dependent MTOs. For iV > 0, the NMTOs are smooth 
and their triple- valuedness decreases with increasing N. For the mesh condensing 
to one energy, £,^, the NMTO basis is of course constructed as linear combinations 
of 4>{e^,r) and its hrst N energy derivatives at e,y. For N=l, this is the well- 
known LMTO set. 
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The immediate practical use of this new development is to widen and sharpen 
the energy window inside which the method gives good wave functions, without 
increasing the size of the basis set. One may even decrease the size of the basis 
through downfolding, and still maintain an acceptable energy window by increa- 
sing the order of the basis set. The prize for increasing N is: More computation 
and increased range of the basis functions. 



3.1 Energy-Independent NMTOs 

What we have done in the previous sections - one might say - is to factorize 
out of the contracted Green function, 7 (e, r) , some spatial functions, <f>RL (&, r) , 
which are so localized that, for two energies inside the energy-window of interest, 
the corresponding functions, (puL (£>r) and fuL (£^^) : cannot he orthogonal. In 
other words: The kinked partial waves are so well separated through localization 
and angular symmetry that we need only one radial quantum number for each 
function. 

Now, we want to get rid of the kinks and to reduee the triple-valuedness and 
the energy dependence of eaeh kinked partial wave - retaining its i?L-character 
- to a point where the triple-valuedness and the cncrgy-dcpcndcncc may both 
be neglected. This we do, first by passing from the set (f> {s, r) to a set of so- 
called Nth-order energy-dependent MTOs, (£)!■) > whose contracted Green 
function. 



N 

(e,r)G(e) = (f){e,r)G{e) - ^ (e„, r) G (£„) (e) , (69) 

n— 0 

differs from (j) (e, r) G (e) by a function which remains in the Hilbert space span- 
ned by the set with energies inside the window of interest, and which 

is analytical in energy. The two contracted Green functions thus have the same 
poles, and both energy-dependent basis sets, (p{e,r) and ! can there- 

fore yield the exact Schrodinger-equation solutions. The analytical functions of 
energy we wish to determine in such a way that (e, r) takes the same value, 
x(-^) (r) , at the N + 1 points, £q, ...,£at. With the set (Sj!") dehned that 
way, wo can finally negleet its energy dependence, and the resulting x*'^^ (i") is 
then the set of Nth-order energy-mdependent MTOs. 

Other choices for the analytical functions of energy, involving for instance 
complex energies or Chebyshev polynomials, await their exploration. 

One solution with the property that x^rl takes the same value for £ at 

any of the N + 1 mesh points, is of course given by the polynomial: 



aW 

^n-,R'L',RL 



(£) 



N 



dR'RSh'L 



S 6n 



m—0,^n 



of Nth degree. But this solution is useless, because it yields: x^^^ (i") = 0- If) 
instead, we try a polynomial of (N — l)st degree for the analytical function, then 
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we can write down the corresponding expression for the set (j") without 
explicitly solving for the {N + 1)^ matrices (Sm) , and then prove afterwards 
that each basis function has its triple-valuedness reduced consistently with the 
remaining error {si — Sq) {si — ei) ... {si — Em) of the set. 

Since we want to be independent of n for 0 < n < N, all its 

divided differences on the mesh - up to and including the divided difference 
of order N — vanish, with the exception of the 0th divided difference, which is 
yXN) ^ consequence, the A^th divided difference of x^’^^ r) G (e) on the 

left-hand side of (69) is x^^^ (j") times the A^th divided difference of the Green 
matrix. Now, the A^th divided difference of the last term on the right-hand side 
vanishes, because it is a polynomial of order IV — 1, and as a consequence. 



(r) 



A^(j)[T)G f A^G \ 
Z\[0...AT] iv^[0...AT]y 



(70) 



This basically solves the problem of finding the energy-independent NMTOs! 
What remains, is to factorize the divided difference of the product (j) (s, r) G (e) 
into spatial functions, (f> (£„, r) , which are vectors in RL, and matrices, G (e„) , 
with ri = 0, ..., N. Equivalently, we could use a binomial divided-difference series 
in terms of (f> (sq, r) and its first N divided differences on the mesh together with 
G{en) and its corresponding divided differences. 

For a condensed energy mesh, defined by: e„ — )• for 0 < n < N, the A^th 

divided difference becomes times the A^th derivative: 



A^f 

Zi[0...Af] 



f[0-N] 



±_ d^f je) 
m de^ ’ 



(71) 



but since a discrete mesh with arbitrarily spaced points is much more powerful 
in the present case where the time-consuming part of the computation is the 
evaluation of the Green matrix (and its first energy derivative for use in Eq. 
(63)) at the energy points, we shall proceed using the language appropriate for 
a discrete mesh. In (71) we have introduced the form / [O...A^] because it may - 
more easily than A^ f /A [O...Af] - be modified to include another kind of divided 
differences, the so-called Hermite divided differences, which we shall meet later. 

Readers interested in the details of the discrete formalism are referred to the 
Appendix where we review relevant parts of the classical theory of polynomial 
approximation, and derive formulae indispensable for the NMTO formalism for 
discrete meshes. Readers merely interested in an overview, may be satisfied with 
the formalism as applied to a condensed mesh and for this, they merely need 
the translation (71) together with the divided-difference form of the NMTO to 
be described in the following. Details about the Lagrange form may bo ignored. 



Lagrange form. We first use the Lagrange form (149) of the divided difference 
to factorize the energy-independent NMTO (70) and obtain: 



N 



X*-^^ (r 






<pn (r) 



n=0 Y[m=0.^n 



G[0..A^]"^ , 



(72) 
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Si p Imto 




Fig. 5. Si pill member of the spd-set of 0th (dottet) and Ith-order MTOs (see text 
and Eq.(74)). 



Here and in the following, (r) = (p (e„, r) and G„ = G (e„) . Eq. (72) has the 
form (68) and we see, that the weight with which the MTO set at £„ enters the 
NMTO set, is: 



= 



Gr, 



Ylm=0,^n 



G[0..fV]^ 



(73) 



By application of (149) to the Green matrix, we may verify that these Lagrange 
weights sum up to the unit matrix. For this reason, the RL characters of the 
NMTO basis functions will correspond to those of the kinked partial waves. 

As an example, for A^=l we get the so-called chord-LMTO: 

(r) = Po (r) Go (Go - Gi)-' + (r) Gi (Gi - Go)“' 

= Po (r) (i^i - Ko)-^ + 01 (r) {K^ - Ki)~^ Ko (74) 

= 00 (r)-0([Ol],r)X[Ol]“'Ao 
— >■ 0 (r) — 0 (r) k~^K. 



In this case, there is only one energy difference, £o — £i, so it cancels out. In the 
3rd line, we have reordered the terms in such a way that the Newton form, to 
be derived for general N in (88) and (90) below, is obtained. In the 4th line, 
we have condensed the mesh onto whereby the well-known tangent-LMTO 
[19,20] is obtained. The latter is shown by the full curve in Fig. 5 for the case of 
the Si Pill-orbital belonging to an sp set. The dashed curve is the corresponding 
kinked partial wave, p (r) , shown by the full curve in Fig. 4. Compared to the 
latter, (r) is smooth, but has longer range. The strong contributions to the 
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tail of the LMTO from cj) (r)’s on the nearest neighbor are evident. It is also clear, 
that for computations involving wave functions - e.g. of the charge density - the 
building blocks will rarely be the XMTOs, but the kinked partial waves, cpn (r) , 
which are more compact. 

One might fear that the discrete NMTO scheme would fail when one of the 
mesh points is close to a one-electron energy, that is, to a pole of the Green 
matrix, but that does not happen: If one of the G„’s diverges, this just means 
that the corresponding Lagrange weight is 1, and the others 0. Hence, in this 
case the NMTO is just <pn (r) , and this is the correct result. Moreover, the 
kink of this single cf>n (r) does not matter, because in this case where G (e) is 
at a pole, the determinant of its inverse vanishes, so that the kink-cancellation 
equations, KnCn = 0, have a non-zero solution, c„, which yields a smooth linear 
combination, (f>n (r) c„, of NMTOs. 

Kinks and triple- valuedness. The energy-independent NMTOs have been de- 
fined through (69) and (70) in such a way that (£) r) (r) oc (e — £q) ... 

(e — Sn)- We now show, that also the kink-and-triple-valuedness of (r) is 
of that order, and therefore negligible. 

The result of projecting the energy-dependent MTO onto Yl> {^r') for an 
active channel was given in (37) for its own channel, and in (40) for any other 
active channel. Together, these results may be expressed as: 

'Pr'lxPrl (s, rn) = {e, rn) 6 r>r5l'l + iwL' (£, ^r') x 

[«:cot {e) Sr'rSrir + (e)] 

or, in terms of the renormalized functions (44), (46), (47), and (54), as well as 
the kink matrix defined in (49), as: 

'Pr'l"Prl (S: ^r) = ‘fm (£> ^r) 5r'r5l'l + jwL’ ^R') (^) ■ 

Here, like in (37) and (40), contributions from MT-overlaps - which are irrele- 
vant for the present discussion - have been neglected. Without kinks and triple- 
valuedness, Vri L'4’rl rfl) would be given by the first term, and the kinks and 

the triple-valuedness are therefore given by the second term: 

T~R'L'<Prl (£> ^r) = Jr'L' {^^'’'R') ^R'L'ML (s) • C^5) 

This vanishes for those linear combinations of MTOs which solve the kink- 
cancellation conditions. 

What now happens for the energy-independent approximation, x*'^^ (r) = 
(r) , to the Oth-order energy-dependent MTO, x^°^ {s, r) = ()) (e, r) , is that the 
former has kinks and triple-valuedness, but both are proportional to K (eo) which 
- according to (56) - is proportional to 71— Sq and, hence, to £o- The kinks and 
triple-valuedness are thus of the same order as the error of x*-°^ (r) ■ Similarly, for 
> 0, the fact that the (£)’s are polynomials of [N — l)st degree, reduces 
the triple-valuedness of x*'^^ fo being proportional to (e — £q )...(£ — £at) , 
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as we shall now see: Multiplication of (75) with G“ (e) from the right yields: 
T 4>°‘ (e,!") G“ (e) = (e,r) , and for the kinks and the triple- valuediicss of the 

contracted Green function (69) we therefore get: 

N 

(£, r) G (£) = f (£, r) - ^ f (e„, r) 4^) (e) . 

n— 0 

Taking again the A^th divided difference for the mesh on which (e, r) is 
constant yields: 

T-^W(r) = j-([o...iV],r)G“[0...iV]-' (76) 



= ([0...7V] ,r) - £o) - £i) - 

for the kinks and the triple-valuedness of the energy-independent NMTO. In 
the last line, we have used an expression - which will be proved in (83) - for 
the inverse of the Al^th divided difference of the Green matrix in terms of the 
product of energy matrices to be defined in (81). At present, it sufhces to note 
that differentiatioir of the Green function, 

(77) 



for a model with one, normalized orbital yields: 



' 1 d^G (£) 


■ 


-1 




IV! de^ 


Si, 




y (£,-£.)^+^_ 



(ffi 



£.7 



A7+1 



where the last approximation holds when the mesh is closer to the one-electron 
energy of interest, Si, than to any other one-electron energy, Sj ^ £j. Note that j 
- and not n - denotes the radial quantum number. Similarly, this model Green 
function has a divided difference on a discrete mesh of A^+1 points, whose inverse 
is: 



G[o..iv; 




£n) 



N 



(£-7 £n) 1 

n— 0 



(78) 



as proved in Eq. (159) of the Appendix. We have thus seen that the triple- 
valuedness is of the same order as the error present in (i") due to the neglect 
of the energy-dependence of x^^^ (£j J") ■ 

The radial function j“ (£, r) in (75) vanishes for r < a, where it has a kink of 
value 1/a^, and it solves the radial wave equation for r > a. As shown in [51], 
its expansion in powers of r — a > 0 is: 



rf{e,r) = ''—^ + ^[l{l + l)~sa‘^] 



^ [l81(? + l) + (i(Z + l)-£a2)^] 



r — l{l + l) ( r ~ a'' 

3!~ 



21 r — a 
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This means the A^th divided-difference function entering (76) satisfies: 
f{[0...N],r) cc 

The kink and triple-valuedness (76) in the s — a shell of (r) is thus pro- 
portional to (r — ~ ^") , and for this reason the energy-window 

widens as ,s — a decreases, that is, as the screening increases. 



Transfer matrices and correspondence with Lagrange interpolation. 

We need to work out the effect of the Hamiltonian on the NMTO set. Since the 
NMTOs with > 0 are smooth, the contributions from the delta-function on 
the right-hand side of (57) for the contracted Green function will cancel in the 
end. Operation on (69) therefore yields: 

U ^(j) (£, r) - x^^^ (e, r)] G (e) = 0 (e, r) eG (e) - (e, r) G (e) 

= n £nG„Al^) (£) 

^ — ^n—0 

and by taking the A^th divided difference for the mesh on which x*"^^ 
constant, we obtain: 

H7([0...Af],r) = nx^^H^)G[Q...N] = (<^£G) ([O...Af] , r) 

= 7([0..A^- 1] ,r) -t- £at 7([0...A^] ,r) , (79) 

using (151) with the choice of the last point on the mesh. Solving for the NMTOs 
yields: 






(80) 



where = 7 ~ 1] ? G[0..A^ — 1] ^ is the energy-independent 

MTO of order TV — 1, obtained by not using the last point. Moreover, 



~ SN + G[0..N -1]G[0...N]~^ ={sG)[0...N]G[0...Ny'^ 

^n.Gri, 



N 



N 



= E 



;To rim=0,#n 



G[0...iV]-' = ^£„4^\ 



(81) 



is the energy matrix which - in contrast to (r) - is independent of which 

point on the mesh is omitted. The first equation (81) shows how to compute 
and the last equation shows that is the energy weighted on the O...Af- 
mesh by the Lagrange matrices (73). For a condensed mesh, the results is the 
simple one (19) quoted in the Overview. 

We now consider a sequence of energy meshes, starting with the single- 
point mesh, £q, then adding e\ in order to obtain the two-point mesh £q, £i, 
then adding £2 obtaining the three-point mesh £q, £ 1 , £ 2 , a.s.o. Associated with 
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these meshes we obtain a sequence of NMTO sets: the kinked-partial wave set, 
(r) ,thc LMTO set, (r) ,thc QMTO set, (r) > a.s.o. Working down- 
wards, we thus always delete the point with the highest index. Equation (80) now 
shows that % — en may be viewed as the step-down operator and — Eat as 
the corresponding transfer matrix with respect to the order of the NMTO set. 

In this sequence we may include the case N=0, provided that we define: 

— £o = —K (eo) and (i") = (i") ■ (82) 

+ 1 successive step-down operations on the NMTO set thus yield: 

[U - Eo) ... {U - Eat) x^^^ (r) = ^ (r) ~ eq) ... - en^ 

which, first of all, tells us that one has to operate N times with - that is, 
with - before getting to the non-smoothness of an NMTO. This is consistent 
with the conclusion about kinks and triple-valuedness reached in the preceding 
sub-section. Secondly, it tells us that the higher the N , the more spread out the 
NMTOs; if we lot r (M) denote the range of the £l*^'^^-matrix, then the range of 
the NMTO is roughly X^m=o ^ i^) ■ 

The product of — Eq and all the transfer matrices on the right-hand side 
of the above equation is seen from (81) and (82) to be simply: — G[0...A^] ^ . 
Hence, we have found the matrix equivalent of the elementary relation (78): 

-G [0...1V]“^ = - Eo) - ei) ... - en) . (83) 

The other way around: Recursive use of (83) with increasing N , will generate the 

transfer matrices and will lead to the first equation (81). Note that although the 
order of the arguments in the divided difference on the left-hand side is irrelevant, 
the order of the factors on the right-hand side is not, since the transfer matrices 
do not commute. That G [O...A^] is Hermitian, is not so obvious from (83) either. 
Finally, we may note that G[0..n— l,n+ l..A^] is not defined by (83) but by 
(148): 

G[0..n- l,n+ 1..1V] = G[0...fV-l] -f (e^ - e„) G [O....A^] . 



Relation (83) now gives the following form for the Lagrange weights (73): 






^ N -1 - Eq) .. ( E (") - £„) .. (^(^) - en) 

^ ij^n '^o) •• (^n l) (^n ^n+l) •• (^n 



(84) 



and this is seen to pass over to the classical expression (146) for the Lagrange 
coefficients if we substitute all energy matrices by the energy: e. This 

correspondence between - on the one side - the set 4> (e, r) and the Lagrange 
polynomial approximation (146) to its energy dependence (Fig. 1) and - on the 
other side - the set (r) expressed by (68) with the matrix form (84), is 
conceptually very pleasing. What is not so obvious - but comforting - is that 
the Hilbert space spanned by the NMTO set is invariant under energy-dependent 
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linear transformations, </> (e, r) = (j) (e, r) T (e) , of the kinked partial waves. This 
will be shown in a later section. 

By taking matrix elements of (80), the transfer matrix may be expressed as: 

£<"> - « = (y»> I (y"> m - e„i y»>) . (ss) 

This holds also for A^=0, provided that we take the value of (r) at its scre- 
ening sphere to be Lp° (s, a) = 1 - as dictated by the 3-fold way - so that 
I = 1- The form (85) shows that the transfer matrices with > 1 

are not Hermitian, but short ranged, as one may realize by recursion starting 
from N=0. Finally, it should be remembered that the NMTOs considered sofar 
have particular normalizations, which are not: 1 = 1) a-nd so do the 

transfer matrices. We shall return to this point. 



Newton form. Instead of using the Lagrange form (149) to factorize the NMTO 
(70), we may use the divided-difference expression (150). With the substitutions: 
f (e) ^ G {e) and g {e) <p {e, r) , we obtain the Newton form for the NMTO 
which most clearly exhibits the step-down property (80): 



yW (r) = {[M..N] , r) G [0..M] G [0...iVr' 

= <t^N{r)+ct>{[N- 1, IV] , r) - En) + ■■ 

.. + 0 ([0...N] , r) .. (^e^n) ^ j ^ 



( 86 ) 



since, from (56) and (79), 

(TI-£Ar)07v(r) = -5n,oS{'<^)Ko, 

{n - Sn) 4> {[M...N] ,r) = (!> {[M..N - 1] , r) . (87) 

We thus realize that the energy matrices in the Newton series for the NMTO 
set arc the matrices for stepping down to the sets of lower order. For some 
purposes, the ’reversed’ series, obtained from (150) with /(e) /(e,r)G(e) 

and <7 (e) ^ G (e): 

^(N) (r) = <P ([0..M] , r) G [M..N] G [0...N]-' 

= /o (r) + / ([01] , r) - £o j + .. (88) 

.. + ([0...1V] , r) (^(1) - £^_i) .. (sW - £o) , 



is more convenient. This expression clearly exhibits the correspondence with the 
Newton polynomial approximation (147) to the energy dependence of </(e, r) . 
Conceptually, a divided-difference series is more desirable than the Lagrange 
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series, because the Lagrange weights (84) ’fluctuate wildly’ as a function of n, 
taken in the order of inonotonically increasing energies. 

For a condensed mesh, (86) and (88) obviously reduce to one-and-the-same 
matrix-equivalent of the Taylor series for cp (e, r) : 

^(N) ^ + (p (r) -I- .. 

^ (r) - e^) •• , 

and (87) becomes: 

{N-M) (N-M-1) 

(H - ^ (r) = (r) K. (U - 

Readers used to the LMTO-ASA method, where - according to (12) - the 
KKR matrix is basically the two-center TB Hamiltonian, may not like the 
thought of having to differentiate its inverse, the Green matrix, with respect 
to energy. (The computer seems to work well with the formalism based on the 
Green matrix) . Such readers might therefore prefer an NMTO formalism in terms 
of kink matrices. For a discrete mesh many ugly relations exist, but the one re- 
lation which is conceptually pleasing is the following: 

0 = (89) 

Ko + K [01] (aW -eo)+.. + K [0..1V] - sn-i) ■■ - eo) , 

because it looks like the matrix form of the seeular KKR equation: \K (e)] = 
0. This relation may be obtained by taking the A^th divided difference of the 
equation: K (e) G (e) = 1, using the binomial expression (150) for a product like 
in (88), but with K (e) substituted for cp{e,v ) , and multiplying the result from 
the right by G [O...A^] . To find the transfer matrices from (89), we may solve 

for — So and do recursion starting from N=l. The results are: 

-eo = -A:[01]^^ Ao ^ -k~^K, 

£^(2) -eo = - (^A[01] + A[012] -ei)) \o (90) 

^ -(k - kk-^K/2^ ^ K, 

a.s.o. These low-iV expressions are reasonably simple. For N=l, the discrete 
form is seen to be identical with (74) and, for a condensed mesh, it reduces 
to the well-known expression for the 3rd-generation LMTO. We conclude that 
the energy matrices, E'^’^\ are well-behaved functions of the kink matrix and 
its divided differences, up to and including Mth order. With M increasing, the 
corresponding expressions for E^^~^ however become more and more compli- 
cated. The simplest expression for E^^'^ is therefore (81), the one which uses 
G-language. 
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3.2 Variational NMTO Method 

The NMTO set has been defined through (69) and (70) in such a way that 
its leading errors are proportional to (e — £q) .. (e — Eat). By virtue of the va- 
riational principle, solution of the generalized eigenvalue problem (5) with this 
basis set will therefore provide one- electron energies, Si, with a leading error 
oc (e, — £o)^ .. (£i — £jv)^ • The error of the wave function will of course still be 
of order (£j — £q) .. (£j — £at) , but that is usually all right because, as mentioned 
at the beginning of the present section, the MTO scheme is based on the facto- 
rization: 7 (£, r) = (f> (£, r) G (£) , where (j) {e, r) has a smooth energy dependence 
and G (£) provides the poles at the one-electron energies. 



Hamiltonian and overlap matrices. For a variational calculation, we need 
expressions for the NMTO overlap and Hamiltonian matrices, | 

\'H\ X*'^^)' From (69), the Nth. divided difference of the contracted Green 
function (58) is: 



{[0..N] , r) = x^^^ (r) G [0..N 



N 



(r) G„ 



n=0 Ylm=0,^n ^rn) 



(91) 



and using now (63), we obtain for the integral over the product of the Mth and 
Nth divided differences of contracted Green functions: 



h [0...MI I 7 [0....iV|) = f t — 

n-On'-O 

(M+iV+1) 

= -G\\0...M]..N] ^ (92) 

^ ^ (M + V + 1)! ^ ^ 

This is simply the negative of the (M -|- + l)st Hermite divided difference 

(152) of the Green matrix, as proved in Eq. (160) in the Appendix! 

Note that the meaning of a matrix equation like (63) is: 



{iRL (e«) I XR'L' (sn')) = -Grl,r'L' [n,n'] 

= -Grl,R'L' [n' ,n] = {xrl (£«') | IR'U (£«)) • 

In matrix notation, that is: (x„ | 7 „/) = {xn' \ In ) , and not: (x„ | xw) = {xw \ Xn)* ■ 
Even without the symmetry of the matrix G [n, n'] with respect to the exchange 
of n and n' , it is of course always true that 



{XRL {£n) I XR'L' (£n')> = ilR'L' (Sn') | XRL {Sn))* , 

i.e. that a matrix like {xn \ Xn’) is Hermitian: {xn \ Xn') = {xw \ 7n)^ • The 
point is, that n is an argument - not an index - of a matrix. Similarly, N and M 
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are not matrix indices in (92). Since the first expression (92) is symmetric under 
exchange of N and M, because G [n, n'] is symmetric, we may choose M < N, 
and this has in fact been done in the second expression. 

From (79) and (92), we now see that the Hamiltonian matrix between the 
Al^th divided differences of contracted Green functions becomes: 

(7 [0...N] |-H - £at| 7 [0...N]) = (7 [0...fV] | 7 [0..N - 1]) 

(2N) 

= -G[[0..N-l]N] ^ (93) 

Hence, we have arrived at the important results: The NMTO overlap matrix may 
be expressed in terms of the A^th-order divided difference and the {2N + l)st 
Hermite divided difference of the Green matrix as: 

= -G[0...Af]“^G[[0...iV]] G[0...Af]“\ (94) 

where the - even simpler - result for a condensed mesh was quoted in the Over- 
view (20). The Hermite derivative G [[0, ..., A^]] is thus negative definite. The 
NMTO Hamiltonian matrix may be expressed analogously, in terms of a 2N^th- 
order Hermite divided difference: 

(x^^^ \'H - = -G [O...N]^^ G [[O..A^ - 1] Af] G [O...N]^^ . (95) 

Here again, the result given in (20) for a condensed mesh is even simpler. The 
NMTO Green function is 

G |0...iV] {G |[0..« - 1| «] - (z - ew) G ||0...«]|}“‘ G [0.,.W| 

Expressions (94) and (95) for the NMTO overlap and Hamiltonian matrices 
are not only simple and beautiful, but they also offer sweet coding and speedy 
computation. For a crystal, and transforming to k-representation, one may even 
use the representation of contracted Green functions where the overlap and Ha- 
miltonian matrices - according to (92) and (93) - are merely — G [[O...N^]] and 
— G [[0..N — 1] N] . In Section 4 we shall see that an energy-dependent linear 
transformation of the kinked partial waves does not change the Hilbert space 
spanned by an energy-independent NMTO set - but only the individual basis 
functions. Therefore, we might also use kinked partial waves (e, r) and the 
Green matrix G“ (e) with phase-shift normalization. 

In summary: The variational NMTO scheme requires computation of the kink 
matrix and its hrst energy derivative at the N +1 mesh points. It delivers energies 
and wave functions which are correct to order 2N + 1 and N , respectively. This 
lower accuracy of the wave functions is appropriate because the kinked partial 
waves arc rather smooth functions of energy. For the computation of the 9„’s 
entering Kn = a (^Bn — c)„ j , radial normalization-integrals should be used. 
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As an example, for the LMTO method, the Hamiltonian and overlap matrices 
are respectively: 

= -G[01]"^ G[[0]1] G[01]“^ 

= (£o-ei)(Go-Gi)-^ (-Go + G[01]) (Go-Gi)-^ (96) 

^ -G"i^G-i = -K + KK-^^k-^K. 

2 ! 2 ! 

and 

(x« I = -G[01]-i G[[01]] G[01]-i 
= (Go-Gi)-' (-Go + 2G[01] -Gi) (Go-Gi)-' (97) 

^ = k - Kk-^% -%k-^K + Kk-^^k-^K. 

3! 2! 2! 3! 

The result for a condensed mesh in terms of the kink matrix and its first three 
energy derivatives is seen to be almost identical to the one (16), which in pre- 
vious LMTO generations required the ASA. To get exactly to (16), one needs to 
transform to the LMTO set: x*^^^ (r) = x*-^^ (r) which in fact corresponds 

to a Lowdin orthonormalization of the Oth-order set. We shall return to this mat- 
ter in Sect. 6. From the above relations we realize that - even for a condensed 
mesh and N as low as 1 - G-language is far simpler than /L-language. 



Orthonormal NMTOs. In many cases one would like to work with a repre- 
sentation of orthonormal NMTOs, which preserves the i?L-character of each 
NMTO. In order to arrive at this, we should - in the language of Lowdin - 
perform a symmetrical orthonormalization of the NMTO set. According to (94) 
such a representation is obtained by the following transformation: 

^(N) (r) ^ ^(N) (j,) g V-G[[0...A]]^\ (98) 

because it yields: 

I = -V-G[[0...fV]]^''G[[0...fV]] V-G[[0...iV]]'' = 1. 

Note that this means: — G[[0..A]] = \J —G [[O..A^]]^ -\/— G [[O..A^]]. In this ortho- 
normal representation, the Hamiltonian matrix becomes 

\n - en\x^^'^) = -x/-G[[0...fV]]''^ X (99) 

G [[O..A^ - 1] N\ V-G[[0...A^]]^\ 

To find an efficient way to compute the square root of the Hermitian, positive 
definite matrix — G [[0...A]] may be a problem. Of course one may diagonalize 
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the matrix, take the square root of the eigenvalues, and then back-transform, 
but this is time consuniing. Cholesky decomposition is a better alternative, but 
that usually amounts to staying in the original representation. Lowdin orthogo- 
nalization works if the set is nearly orthogonal, because then the overlap matrix 
is nearly diagonal, and Lowdin’s solution was to normalize the matrix such that 
it becomes 1 along the diagonal and then expand in the off-diagonal part, O : 

VTTO~^ = I --0 +-0^ - ... (100) 

2 8 

This should work for the NMTO overlap matrix (94) when the NMTOs are nearly 
orthogonal, but it hardly works for —G [[0...7V]] . There is therefore no advantage 
in pulling out the factor G[0...Af] , on the contrary. The other way around: In 
order to take the square root of —G [[O...A^]] , we should find a transformation, 
T, such that T^G [[O...N]]T is nearly diagonal, and then perform the Lowdin 
orthonormalization on the latter matrix. We shall return to this problem in 
Sect. 5. 



One-orbital model: switching behavior of and the varia- 

tional energy. Our development of the NMTO formalism has been focused 
on its matrix aspects and, through the introduction of energy matrices and by 
pointing to the correspondence with classical Lagrange and Newton interpola- 
tion of the energy-dependent kinked partial waves, we have tried to make the 
reader accept the seemingly uncomfortable fact, that the quantities of interest 
do arise by energy differentiations of a Green matrix. 

Let us now illustrate the Green-function aspects by considering the 1x1 
Green matrix (77) for one, normalized orbital: (r) = 'Pj (r) with | ^ = 

1. Note that in this model, j runs over the one-electron energies, which is a dif- 
ferent set - with much larger spacing - than the energy mesh whose points 
are denoted n and m. For a crystal, and using Bloch-symmetrized NMTOs and 
Green matrices, x^’^^ (k, r) and G (e, k) , this would be an s-band model with j 
being the radial quantum number. We want the NMTO to describe the i-band 
and therefore choose the mesh between Si-i (k) and Si+i (k) . In the following 
we shall drop the Bloch vector and not necessarily consider a crystal. 

We first demonstrate how - in this case a 1 x 1 Hamiltonian (see 

Sect. 5) - expressed in terms of ratios of energy derivatives of a Green function, 
with its singTilar behavior, produces correct results for the one-electron energy 
and how, when the mesh is swept over a large energy interval, switches 

between bands with different radial quantum numbers. From (81) and (78) we 
get: 
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Hence, for the model and an energy mesh with N + 1 points, equals Si 

to order N, with an error proportional to (e, — £o) (si ~ ^n) , which for a con- 
densed mesh becomes (e, — Si,) . In Fig. 6 we show (s,^) for = 1 to 

6, computed from the above expression for a four-level model with ej = 0, 1, 2, 
and 3, and a condensed mesh. We see that (s^) behaves as it should: It 

switches from one level to the next, with the plateau around each level flattening 
out as N increases. For N odd, the switching-curve is step-like and, for N even, 
the switching is via — (X) — >■ -l-oo. This comes from the ability of the denominator 
in the expression for to be zero when A"-|- 1 is odd. An energy- independent 

orbital, as considered in the present model, can of course only describe one band. 
With the NMTO defined for a mesh condensed onto a chosen energy we want 
to describe the band near 6i, as well as possible - also if the distance to the next 
band is small - and with a result which over a large region is insensitive to the 
choice of In a multi-orbital calculation, we should fold down those channels 
which are switching in the energy range of interest into the screened spherical 
waves. This will remove schizophrenic members of the NMTO set and prevent 
the possible occurrence of ghost bands. 

In the one-orbital model, the estimate of a true, normalized wave function, 
(f>{si,r), is the Nth-order mufhn-tin orbital: (r) = Yin 4>n (r) If we 

now use (77) and (78) to evaluate expression (73) for the Lagrange weights, we 
find: 
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where l^'^ (e) is the Lagrange polynomial (146) of degree N . We have therefore 
reached the conclusion that - in our orthonormal model, and to leading order - 
the wave function is the energy-dependent MTO, (j> (e, r) , Lagrange interpolated 
over the (N+l)-point mesh. 

Since the error of an NMTO set is of order N+1, use of the variational 
principle will reduce the error of the one-electron energies, from that of the 
highest transfer matrix, — e^, to order 2(N+1). The variational energies 

are thus correct to order 2N+1. For a condensed mesh, this also follows trivially 
from (94)-(95), which show that the variational energy, with respect to is: 



in - £1.1 

{aW I xW) 
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The odd-ordered switching curves {e^) , and {sn) shown in 

the left-hand panel of Fig. 6 are thus the variational estimates resulting from 
the use of respectively the 0th, 1st, and 2nd-order NMTO, that is, the MTO, 
the LMTO, and the QMTO. These curves are well behaved. 

The expression for the variational energy in the one-band model can be eva- 
luated exactly, also for a discrete mesh, and yields a transparent result. We use 
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the double-mesh procedure explained in the Appendix after (152), and let the 
differences e„ = En+N+i — £n shrink to zero. From (78) we then get: 
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and for the variational energy (99): 



1^ - ejvl = (£i - £ n ) 
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which of course agrees with the variational principle. 





Fig. 7 . Minimal-basis LMTO energy bands (dashed) of GaAs for two different choices 
of the screening- radii compared to the exact KKR band structure (solid). In the left- 
hand panel all screening radii were ~ 0.8t, while in the right-hand panel the Ga d 
radius was reduced to the radius of the Ga 3d core [24]. See text. 



Treating semi-core and excited states: GaAs. An accurate description of 
the cohesive properties of GaAs requires a good band-structure calculation of the 
five Ga semi-core, the As 4s^-band, and the three As Ga Asp^ valence 
bands. If also the four lowest conduction bands must be described, one is faced 
with the problem of computing a band structure containing extremely narrow 
as well as wide bands over a 20 eV-region. To do this ab initio with a minimal 
Ga spd As sp basis set (13 orbitals per GaAs), has hitherto not been possible. 
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With 1st and 2nd-generation LMTO-ASA methods one would normally use 
Al-depcndeiit e^’s and employ a 36-orbital-per-GaAs basis, consisting of the spd 
LMTOs centered on the Ga, the As, and the interstitial sites in the zincblende 
structure. The conduction-band errors arising from the choice k^=0 are so large 
that the combined correction is needed. Downfolding works for the p and d 
orbitals on the two interstitial spheres, but not for the interstitial s and the As 
d orbitals. With the 3rd-generation LMTO method, downfolding works much 
better, but the energy window is now screening dependent, and the use of Rl- 
dependent e^’s is avoided because it messes up the formalism. 

In Fig. 7 we show - in full lines - the exact (up to 7eV) LDA band structure 
calculated by the screened KKR method, i.e. by the 3rd-generation LMTO me- 
thod using many energy panels and the Ga spd As sp basis. The five Ga 3d^° 
semi-core bands arc at - 15 cV, the As 4s^-band is around - 12 cV, and the three 
As Ga Asp^ valence bands extend from - 7 to 0 eV. Above the gap, there 
are the four As 4p^ Ga Asp^ conduction bands. The dotted lines give results of 
3rd-generation LMTO variational calculations with a condensed mesh and an Si, 
in the middle of the three valence bands. In the left-hand 
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Fig. 8. Mean error in each of the three types of occupied valence bands in GaAs 
calculated with the LMTO and QMTO methods as a function of the expansion energy 
El, for a condensed mesh [24]. See Fig. 7 and text. 



figure, the screening-sphere radii for the active Ga spd and As sp channels were 
chosen at the Ga and As default values, respectively 0.82t and 0.78t, where t is 
half the nearest-neighbor distance. We see that the entire valence-band structure 
is distorted by hybridization with Ga d ghost bands. The dotted bands in the 
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right-hand figure result after changing the Ga d screening-sphere radius to 0.35t, 
which is close to the aetual radius of the Ga 3d eore. Now, the band structure 
looks reasonable: The valence bands near are perfect, but the Ga 3d bands 
are nearly 0.5 eV to high [24]. 

That the variational LMTO method with a minimal basis and a single e^, 
cannot describe all occupied states of GaAs with sufficient accuracy, becomes 
even more obvious from the left-hand side of Fig. 8, where we show - as functions 
of Si, - the average errors of the five Ga 3d bands, those of the As 4s band, and 
those of the three valence bands. The error cx (£, (k) — of the variational 
energy is clearly visible for the narrow Ga 3d and As 4s bands. With e,y’s in 
a narrow range around - 11 eV, the variational error in the sum of the one- 
electron energies gets down to about 250 meV per GaAs. On the right-hand 
side, we show the same quantities, but obtained with the QMTO method. Now 
the errors oc (k) — are acceptable, and there is a comfortable range of 
£^’s around - 10 eV where the error in the sum of the one-electron energies 
does not exceed 25 meV per GaAs. The screening-sphere radii chosen in these 
calculations [24] were: 0.93t, 1.05t, and 0.35t for respectively Ga s, p, and d, and 
0.89t and l.OOf for respectively As s and p. 



LMTO QMTO 




Fig. 9. Like Fig. 8, but calculated using discrete meshes and as functions of the position 
of the last energy point. The first energy points were fixed at the positions indicated 
on the abscissa [24]. See text. 



In Fig. 9 we show the same kind of results, but this time obtained with 
the discrete (Lagrange) LMTO and QMTO methods. The size of the basis set, 
the screening-sphere radii, etc., were as in Fig. 8. For the LMTO method, £q 
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was fixed at the position of the Ga 3d bands and the figure shows the result 
of varying the position e\ of the other mesh point. The quadratic dependence 
on £i of the variational energy-error oc {Si (k) — Sof [Si (k) — Eif is clearly re- 
cognized. Compared with the results of the tangent LMTO method shown in 
the previous figure, those of the chord-LMTO are far superior: With £i’s aro- 
und - 5 eV, the variational error in the sum of the one-electron energies gets 
down to about 30 meV per GaAs, and yet, for N given, the method employing 
a discrete mesh is computationally simpler than the one employing a conden- 
sed mesh. On the right-hand side of the figure, we show the QMTO results 
as functions of £ 2 , with £q fixed at the Ga 3d, position, and £1 at the As 4.s 
position. Here again, the quadratic dependence on £2 of the variational energy- 
error oc {si (k) — £ 0 )^ (£i (k) — £ 1 )^ {si (k) — £ 2 )^ may be seen. We realize, that 
with this discrete QMTO method, mcV-accuracy for the sum of the onc-elcctron 
energies can be reached. 

Finally, in Fig 10 we show the GaAs band structure in a wide (40 eV) range 
around the gap. Further conduction bands now appear above 7 eV and we needed 
to employ a basis consisting of the Ga spd As spdf 2E s QMTOs. £q was chosen 
at the Ga 3d position, £1 near the gap, and £2 10 eV above the gap. The results 
of this discrete QMTO calculation shown by the dotted curves agree superbly 
with those of a multi-panel LMTO (=KKR) calculation shown in full line [24]. 
This proves the power of the 3rd-generation NMTO method. 



Massive downfolding: CaCu02. An increasingly important field of research 
is the electronic structure of real materials with strongly correlated conduction 
electrons. Within a given class of materials, fine-tuning of the interesting pro- 
perties will require detailed knowledge of the single-electron part - the orbitals, 
hopping integrals and basic on-site terms - of the correlated Hamiltonian. In the 
previous review [20] of the drd-generation 0th- and Ist-order differential MTO 
method, we demonstrated for the idealized high-temperature superconductor, 
CaCu 02 with dimpled CUO 2 planes, how one could extract low-energy, few- 
band Hamiltonians by massive downfolding; in the extreme limit; Downfolding 
to one Gu dx 2 _y 2 orbital per Gu site [22,23]. Let us now reconsider this example 
in the light of the new NMTO methods. 

In Fig. II the full lines in all four parts show the (same) full LDA band 
structure in a ±3 eV region around the Fermi level, which for the doping levels 
of interests would be near the energy —0.8 eV of the so-called extended saddle- 
point at X. The conduction band has mostly 0-Gu anti-bonding pda-character 
(O Px - Gu dx 2 -y 2 ) with the bonding partner lying 10 eV lower in energy. The 
bottom of the conduction band is seen to cross and hybridize with a multitude of 
O-Cu pc?7r-bands lying below - 1.2 eV. The top of the conduction band hybridizes 
strongly with a broad O-Ca bonding pdn (O px - Ga dxy) band near A. In 
this situation, one clearly does not want to use the rather ill-defined and very 
long-ranged Wannier orbital for describing the low-energy electronic structure. 
Rather, one wants an orbital which describes the band (including its dependence 
on other relevant low-energy excitations such as spin-fluctuations and phonons) 
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I 1 r 

L r X 



Fig. 10. Energy bands of GaAs calculated with the QMTO method and the energy 
mesh indicated on the right-hand side (dashed) as compared with the exact KKR result 
(solid) [24]. See text. 



in the ±200meV range around sp, that is an NMTO with all channels, except 
Cu dx 2 -y 2 , downfolded and with as short a range as possible. The four dotted 
bands shown in each of the sub-figures result from such calculations [24]. In all 
cases, the screening-sphere radius of Cu was taken to be 0.62t. The upper 

figures illustrate a problem with the 3rd-generation tangent LMTO method: If 
El, is taken where we want it to be, at the —0.8 eV saddle-point deep down in the 
anti-bonding pdcr-band, then the method develops a schizophrenia near the top 
of the band, above 1 eV and near M, which is apparently sufficiently far away 
from that the LMTO ’might consider’ describing the bonding rather than the 
anti-bonding state. 

The resulting orbital has very long range due to the high Fourier components 
caused by the schizophrenia and, as a result, we are forced to take at a higher 
energy than we actually want. With e^= - 0.3 eV, we still get long range as 
seen in the Tipper left-hand figure, and in order to cure that problem we need to 
go to £^=-1-0.3 eV, but then the description of the bottom of the anti-bonding 
band, the extended saddle-point in particular, has substantially deteriorated. In 
the lower left-hand figure we have now switched from the tangent to the chord 
LMTO, and that is seen to help considerably. Finally, the lower right-hand hgure 
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Fig. 11. Conduction band of CaCu02 calculated by massive downfolding to a single 
Cu NMTO (dotted) compared with the full band structure (solid) [24]. See 

text. 



presents what might be called an ’overkill’: We have used the discrete CMTO 
(7V=3) method, and the agreement with the exact result is superb. 

Using integrals of divided differences of MTOs. In all previous derivations 
of the variational LMTO method, the LMTO was expressed as a matrix Taylor 
series (1) and the Hamiltonian and overlap matrices (7) were worked out using 
expressions (12) for | ^ and | . 

The same may be done for the general, discrete NMTO method, although the 
number of terms in the resulting series increases quadratically with N . For this, 
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we first use a divided-difference form - such as (88) - for the NMTO and then 
need expressions for the overlap integrals, (^[O..A^] | 0[O..M]) , and Hamiltoni- 
ans, (0 [O..A^] \'H\ <f) [0..M ]) , between divided differences of kinked partial waves. 
Since expressions (62) and (63) are formally equivalent, we find that, analogous 
to (92), 



(0[O..M] I 0[O...iV]) = {<f)[0...N]\ (t)[0..M]) = K[[0..M].N] (102) 

(M) (M+Af+l) 

a\ = ^ 

M\ j (M + AT + 1)!’ 

where we have assumed M < N. From this result for M = N, it follows that the 
odd-ordered Hermite divided differences of the kink matrix are positive definite. 
For a contracted mesh, this overlap matrix is seen to depend only on M + N. 
For the matrix elements of the Hamiltonian we must use: 
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The upper and lower results on the second line correspond to n ^ min(Af, iV). 
Here again, for a condensed mesh the Hamiltonian matrix depends only on Ai + 
N. 

The resulting expressions for | — 6n \ contain 

the above-mentioned integrals times products of (£;(^-'^+i) — ejv^_i) -matrices. 
The.se expressions are by far not as explicit as equations (94) and (95), and they 
are more complicated for a discrete than for a condensed mesh. We shall now 
consider a more useful application of (102)-(103). 



Charge density and total energy: Si phase diagram. The wave function 
obtained from a variational calculation is: ’f'i (r) = y (r) c, , where we have drop- 
ped the superscript (N) on the NMTO. The eigen(column)vector, Cj, of the gene- 
ralized eigenvalue equation (5) should be normalized according to: c\ (y | y) Ci> = 
5ii', or - regarding CRL,i as a matrix - according to: (y | y) c = 1. The charge 

density is now given by (18), which to a very good approximation is (64) with the 
energy-dependent wave functions in expressions (65)-(66) substituted by their 
matrix Lagrange or Newton series. The computer code would use the Lagrange 
form: 



p{r) =x (r) cc^X (r)^ = X] L^cc^ L^, (r)^ , 



nn' 
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so that in this case, the density-of-states matrix F (e) in (67) should be substi- 
tuted by: 




Equations (65)-(66) then become; 



1pRL,n (rfi) FiiL,n-R'L' ,n' i’ W L' ,n’ (^^R')* i (10^) 

i?i?' LL' nn' 

pI (r) = ^ n (f ) Yl, (f ) ^ (r) Fn^^n ■RL',n' ^Rl\n' (r) , 

LL' nn' 

pi (r) = ^ (f ) y£, (f ) ^ (r) FnL,n ■RL',n' ^°Rl\n' (^) ■ 

LL' nn' 



If one feels that, with the variational NMTO method, the KKR equations have 
been solved with sufficient accuracy, then one may even use (65)-(67) as they 
stand, and interpolate the energy dependences of the wave functions using the 
classical Lagrange or Newton methods (146) and (147). 

In order to solve Poisson’s equation and to compute the Coulomb- and 
exchange-correlation integrals for the total energy and forces, we need to ht the 
charge density by suitable functions. The properties of /y(r) to which wc have 
most easy access are its spherical-harmonics expansions around the various sites. 
For the fitting we therefore choose atom-centered NMTO-like functions which 
have the following advantages: (1) they are the unitary functions for continuous 
fitting at non-touching a-spheres, (2) they are localized, (3) we know the result 
of operating on them with V^, and (4) the integral of any product of two such 
functions is the energy derivative of a kink matrix (102)-(103). 

Our fitting procedure [47] can be outlined as follows: We first place a set 
of screening spheres around each atomic site. This defines our screened Hankel 
functions (29) and divides space into non overlapping intra-sphere parts and an 
interstitial part. It is not necessary to place screening spheres at interstitial si- 
tes, even though the resulting interstitial can be very large. In the intra-sphere 
region we use a spherical-harmonics expansion of the charge density, with the 
components Piji(r) known on a radial mesh. As the screening spheres are rela- 
tively small this summation can be truncated at Z=3 or 4. In the interstitial we 
expand in the screened Hankel functions, (e, r^j) , normalized as in (45) and 
with 3 different, negative energies, of which the lowest is about 4 times the work 
function, that is: 




(106) 




58 O.K. Andersen et al. 




Fig. 12. Total energy of Si as a function of the atomic volume for different structures 
calculated with the full-potential LMTO method [34] and with the present full-charge 
scheme [49,47,48]. See text. 



for all Tfl > ttfiL- With three energies, we can in principle ht continuously with 
continuous 1st and 2nd derivatives. However, in practice it is difficult to compute 
the 2nd radial derivatives of the high-1 components of the charge density. We 
therefore determine the matrix X in such a way that the fitting is continuous 
and once differentiable, that is: X = B°^ [01] ^ Bq) . The functions 
^RL {^r) (106) are those linear combinations of the three (e„,r)’s whose 

value and radial slope vanish in all channels at the screening spheres. These 
functions therefore peak in the interstitial region and their coefficients are 
determined by a least squares fit in the region interstitial to the MT-spheres, 
by sampling the full charge (105), as well as the expansion (106) at a set of 
pseudo-random points. Once the expansion is obtained, it is very easy to solve 
Poisson’s equation. In the intra-sphere part this is done numerically and in the 
interstitial analytically by virtue of the screened Hankel functions solving the 
wave equation. The same expansion procedure can be applied to the exchange- 
correlation energy density e(r) and potential /r(r). This gives a full potential. The 
total energy Etot is also easy to evaluate. The interstitial part of the integrals 
reduces simply to a summation over Hermite divided differences of the slope 
matrix. 

We have applied this procedure to look at the total energy of various pos- 
sible structures for silicon [49]. For each structure we perform a standard self 
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consistent LMTO-ASA calculation. In the last iteration an expansion of the full 
charge density is made and Etot evaluated correctly. The result is shown in Fig. 
12 where, for comparison, we show the full-potential LMTO result from Ref. 
[34]. 




linear overlap 



Fig. 13. Rms error of the valence-band energies in diamond-structured Si as a function 
of the overlap in the atom-centered MT-potential [42,43]. See text. 



Overlapping MT-potential: Si withont empty spheres. The phase dia- 
gram of Si just shown was calculated using LMTOs defined for MT-potentials 
with empty spheres. We now consider the possibility offered by Eq. (28) of allo- 
wing the atom-centered sphere a substantial overlap - like the 50% radial overlap 
shown in Figs. 2-5 - and, hence, of getting rid of the empty spheres. 

The first question is: How to construct such a potential? Our answer is [42] 
that the potential should be constructed such as to minimize the mean squared 
deviation of the valence-band energies from the ones for the full potential. From 
this condition, it then follows that the overlapping MT-potential, v (r^) , 
should be the least-squares approximation to the full-potential, V (r), weigh- 
ted with the valence charge density. This yields a set of coupled equations for 
the shape, f {r) = v (r) — g, and the zero, g, of the MT-potential. The equa- 
tion which arises from requiring stationarity with respect to 5g is of course: 
/ (H — = 0, and it means that the error in the sum of the valence- 

band energies should vanish to leading order. The other equations, which arise 
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by requiring stationarity with respect to Sf (r ) , are coupled integral equations, 
which are complicated due to the presence of the eharge-density weighting. Ta- 
king the charge density to be constant in space, corresponds to minimizing the 
mean squared energy-deviation for the entire spectrum, rather than merely for 
the valence band. Now, in our present implementation, we only took the spatial 
behavior of the charge density into account in the ^g-equation. The resulting 
potentials for diamond-structured Si were shown in Figs. 10 and 11 of Ref. [20]. 
We have recently succeeded in obtaining the overlapping MT potential from the 
full potential obtained from the charge density (106) [43], but in the present 
paper we shall only show results obtained by taking the full potential to be the 
Si+E ASA potential - like in Ref. [20]. 




Fig. 14. Band structure of Si calculated with the 3rd-generation LMTO method for the 
self-consistent Si+E MT-potential (dashed) and for the Si-centered, 60%-overlapping 
MT-approximation to it (solid). The latter calculation included the correction for the 
kinetic-energy error Eq. (28) in the LMTO Hamiltonian, and the value of the MT-zero 
was adjusted in such a way that the average energy of the valence band was correct. 
Hence, the solid band structure corresponds to the last point on the curve marked 
’ideal’ in Fig. 13 [42,43]. See text. 
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Fig. 13 shows three different results for the rms error of the valence-band 
energy as a function of the linear overlap, w = {s/t) — \. For the overlap increasing 
up to about 30%, the rms error falls in all cases, simply because the overlapping 
MT-potential becomes an increasingly better approximation to the full potential. 
Without any overlap correction, the kinetic-energy error (28), which is of second 
order in the potential overlap, initially rises proportional to v [20], and 

this is seen to limit the maximum overlap to about 30%. We may, however, use 
the LMTO equivalent [43] of Eq. (28) to correct each band energy, Si (k) , and the 
results are shown by the two other curves. The dashed curve - marked ’present 
technique’ - uses the ^^-equation as given above, whereas the ’ideal’ curve was 
obtained by adjusting g - iteratively, because g enters the 5f (r) equations - to 
have the mean error of the valence-band energy vanish exactly. It is possible to 
improve upon the ’present technique’ without knowing the valence-band energy 
a priori, and we are currently including charge-density weighting in the 5f (r)- 
equations. This makes the curve flatten out - like the one marked ’ideal’ [43]. 

The solid curves in Fig. 14 show the Si band structure obtained with the 
60% overlapping MT-potential, including the LMTO overlap correction, and 
determining g to yield vanishing mean error of the valence band. The dotted 
curve is the ’exact’ result as obtained with a (3rd-generation) LMTO calculation 
for the Si+E potential. The errors soon in the valence band are certainly no larger 
than 30 meV, but those in the conduction band are larger. 

4 Energy-Dependent Linear Transformations 

If one considers Fig. 1, it might seem as if the energy- window over which an 
NMTO set yields good approximations to the wave functions will be wider if one 
starts out from energy-dependent linear combinations of kinked partial waves: 

<^(s,r) = (j){e,v)f{e), (107) 

which have smoother energy dependencies. Normalized kinked partial waves and 
Lowdin orthonormalized kinked partial waves are examples of cases where the 
divergences of the kinked partial waves at the energies, where a node passes 
through the screening radius, arc avoided. The transformation given by the - 
in general non-Hermitian - matrix T (e) mixes kinked partial waves with the 
same energy and different RL’s linearly. Although the Hilbert spaces spanned 
by the energy-dependent sets, 4> (£> r) and (j) (e, r) , are identical, it is not obvious 
that those spanned by the respective polynomial approximations, 

^ identical, particularly not if one bears only Fig. 1 in mind. 

Depending on the transformation, the resulting tp (e, r) may completely have 
lost its original i?L-character. Since the linear combination, (f> (e, r) , of kinked 
partial waves has active radial functions on other sites, as well as at its own 
site for other L’s, it is not a kinked partial wave in the usual sense, that is, one 
which could have been obtained by a screening transformation. Remember, that 
for 3rd-generation kinked partial waves, a screening transformation is not linear. 
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In the following, we shall assume that the screening radii have been chosen at the 
previous step, in the screening calculation for the structure matrix, and perhaps 
by subsequent re-screening of the G„’s using (55). 

A further motivation for considering transformed kinked partial waves is 
that they might provide the freedom to obtain energy matrices (81) which are 
Hermitian. This would simplify the finite-difference expressions (86) and (88) for 
the XMTO so that they take the simpler form (1) which then - like in (3) - could 
be diagonalized to leading order by the eigenvectors of From expression 

(85) for the transfer matrix, we realize that the condition that a transformed 
E^^'> be a Hamiltonian matrix, is that we can find a transformation with the 
property that 



(y"> I *'"-'>) = 1. (108) 

This formalism could therefore also be the basis for obtaining an orthonormal 
NMTO set. 

Let us finally express the important equations (57)-(63) in terms of the trans- 
formed kinked partial waves: 

[Ti - e)4> (e, r) = (r) K (s ) , {TL - e) $ (e, r) G (e) = -5 (r) , (109) 

where we have defined the non-Hermitian matrices 

k{e) = K{e)f{e), G {e) = K {ey'^ = f {ey^ G {e) . (110) 

Note that these definitions do not correspond to similarity transformations. The 
kink matrix, K (e) , and thereby its inverse, G (e) , were originally defined in such 
a way that they are Hermitian, but they are inherently ’skew’, because (109) tells 
us that it is the ’one-sided’ contraction of the Green function, 

7 (£, r)=4> (£, r) G{e)=ij) {e, r) G (e) , (111) 

which is invariant. For the same reason, the integrals of the products of two 
contracted Green functions, with possibly different energies, form an overlap 
matrix. 



G(£)t (0(£) U(£'))g(0 = (112) 

which is independent of T (£) . 

Adding to the discussion following (92) about the meaning of the matrix 
equation (<))„ | ())„/) = (())„/ | ^„) , note that this equation does not hold in a 

general representation: \ <pn'~^ = TXfl7^ (^(pn' \ ^ | , 

unless Tn = Tn' ■ But it is of course always true that | = (^pn' \ ■ 

We now come to derive NMTOs from the transformed kinked partial waves 
(107). Since the arguments around expression (69) concerned the contracted 
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Green function, which according to (111) is invariant, (69) is unchanged but 
should be rewritten in the form: 



N 

(e, r) G{e) = <i> {e, r) G (e) - ^ (r) G„ (e) . 



n—0 



As a consequence, (70) should be substituted by: 

^ A^^{r)G ( A^G \ " ^ A^c^{v)G ( A^G \ 
^ A[0..A] l^A[0..A]y A[0..AT] l^A[0..A]y 



(113) 



(114) 



The last equation (114) shows that the polynomial approximation to the 
transformed energy-dependent NMTO, (e,r) = {^a)T {e ) , is 

(r) = ^ ^ [0...1V]-' , (115) 



which is a linear transformation. Hence, regardless of the energy-dependent 
transformation T (e) of the kinked partial waves, all NMTO sets span the same 
Hilbert space and all energy-windows are therefore identical. This disproves the 
above-mentioned naive conclusion drawn from Fig. 1. Since G {e) = T (e) G (e) , 
we may express the NMTO transformation (115) as a Newton series (88) for 

f{s): 



G[0...A] G[0...A]“^ = (TG) [0...N] G[0...N 



1-1 



(116) 



^^_^r[0..M] G[M..N] G[0...7V]^^ 

-eN-i) .. . 



■‘M=0 

= fo + .. +T’[0...A] 



Since the contracted Green function is invariant, so arc equations (92) and (93) 
which relate the overlap and Hamiltonian integrals of such functions to Hermite 
divided differences of G (e) . For the NMTO overlap and Hamiltonian matrices, 
we therefore obtain (94) and (95), with the prefactor substituted by G [0..A] , 

the postfactor substituted by G [0..A] , and the Hermite divided differences of 

G (e) unaltered. 

The hrst equation (114) shows that the expressions derived previously for the 
NMTOs, excluding those for integrals over NMTOs, may be taken over, after 
these expressions have been subject to the following substitutions: 

(f) {e, r) — ^ (£, r) , K [e) ^ K {e) , Ln Ln \ fliyi 

X (e, r) ^ X (£, r) , G* (e) ^ G (e) , ^ 



Remember, that the substitutions for K (s) and G (e) do not correspond to a 
similarity transformation. 

As long as we only consider T (e)-transformations which are independent 
of N , the step-down relation (80) holds for the transformed NMTOs and for 
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its transfer matrices, because the derivation merely made use of (57), which 
transforms into (109). This shows that — Sq equals —Kq = —KqTq, as 
expected, but that: | = 1 does not hold. The hatted version of (85) 

therefore only holds for > 1. For = 0 : 

\n - £o| - £o) = - eo- (118) 



The expressions for the transformed NMTO in terms of divided differences of 
transformed kinked partial waves are the hatted versions of (86) and (88). One 
should remember that the divided difference, 0([O..M] ,r) , is a linear combi- 
nation of the M + 1 functions <po (r) Tq, .., (pM (r) 7 m, and hence, a linear com- 
bination of the M + 1 divided differences: (pQ (r) , ..,<p{\Q..M\ ,r). This is the 

generalization of the property: dtp (e, r) T (e) /de\^ = p{r)T + p (r) T, used in 
the 2nd-generation LXITO formalism. Explicitly: 



M 

- sr 



r)T„ 



n=0 lim=0,5^n 



(119) 



M 

= ^ p{[m..M] ,r)f'[0..m] = p{[0...M] ,r)fo + .. + pM {v)f[0...M' 

m=0 



The transformed versions of the results (102), (103) are complicated, unless T (e) 
is independent of e. In that case, the right-hand sides iust have K (e) substituted 
by f^K {e)f = K (e) . 

Usually {p[Q..M] \ p{Q..N]^ (<^[0-A^] | (^[O..M]^, unless T (e) = f, or 

the matrix is diagonal; (p\f)..M\ \ (^[O..A^]|) = ^<^[0..A^] j pf)..M^ of course 
always holds. 



5 Hamiltonian Energy Matrices and Orthonormal Sets 

Having seen that an energy-dependent, linear transformation (107) of the MTO 
set does not change the Hilbert space spanned by the set of energy-independent 
NMTOs, but merely the individual basis functions, we now turn to the objective 
of finding a representation in which the energy matrices - but not neces- 

sarily the Green matrix G (e) — are Hermitian. The energy matrices will then be 
the two-center Hamiltonians entering expressions like (1) for the orbitals. From 
(85), we obviously want: 

E^^^ - £m = \n - em\ - Em ( 120 ) 

for 1 < M < N, and since this condition leads to the near- orthonormality 
condition (108), it guides the way to make one of the NMTO sets - let us call 
it the Lth - orthonormal. 
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In order to solve the N near- orthonormality conditions for the Hamiltonian 
matrices, we first insert the transformed version of expression (83) for the in- 
verse of the Mth divided difference of the Green matrix in terms of the transfer 
matrices and ~ £o, defined by (118), 

-G ^ ^^21) 

into the transformed version of expression (95) for the Hamiltonian in terms of 
tlie 2Mth Herniite divided difference of the original Green matrix G (e) . We 
then use (120) and notice that one factor — sm cancels out so that the 

equation may be solved for this highest transfer matrix: 



fj(M) 





' (i?(“-i) - £m-i) .. -£i) ( 


- eo) 


Sm = 


(-G [[0..M - 1] M]) Tg^^t 






1 

X 

1 

rn 

o 

1 





-1 



for M >1. Solving recursively for the transfer matrices, and including (118) at 
the top, wc obtain the following results: 

m-eo = -TJG[[]0]-'T’o 

- Si = [[ ] 0] G [[0] 1]“^ G [[ ] 0] 

~S2 = -TJg[[]0]-'G[[0]1] G[[01]2]-'g[[ 0]1] G[[]0]-'fo 

- Sm = [[ ] ... G [[0..M - 1] My^ 

( 122 ) 

where for reasons of systematics we have used the notation (154): 

G [[ ] 0] = G [0] = Go = i^g-\ 
explained in the Appendix. 

The divided differences (121) of the transformed Green matrix are needed 
for specification of the transformation via (110), the orbitals via (115), or the 
transformed kinked partial waves via (111), and are seen to be given by: 

G[0]“' = G[[]0]-'fo 

G[01]~^ = -G[[0]l]^^G[[]0]fg‘^t (123) 

G[012]^^ = G[[01]2]^^G[[0] 1] G[[]0]^^Ti) 

G [0...M]“^ = (-)“ G [[0..M - 1] M]~^ ...G [[ ] 

Since we originally had the + 1 matrices Tq-.-Tn at our disposal and have 
used N to satisfy the near-orthonormahty conditions, we have one, Tq, left. This - 
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and thereby implicitly also the other T„’s - may now be chosen equal to a matrix, 
Tq, which makes the Lth set orthonormal. Note that whereas the transformation 
T (e) did not depend on the order of any basis set, the transformation T (e) does; 
it depends on L. 

Let us first discuss whether the transformation (98) to an orthonormalized 
NMTO set may at all be arrived at by an energy-dependent linear transfor- 
mation of the kinked partial waves: According to (115), othonormality of the 
Lth set happens for any transformation T (e) which satisfies: (T^^G) [0...L] = 
(— G [[O...L]])^^^ , where G[[0..L]] is the {2L + l)st Hermite divided difference 
(152) of the original Green matrix. Hence, this is a linear equation between the 
L -|- 1 values of the matrix T (s) at the hrst L + 1 mesh points, and it is 
therefore plausible that it may be used to hx Tb. 

The better way of writing this equation is, like for the Hamiltonian matrix, 
to insert (121) for G [0..L] ^ into the transformed version of expression (94) for 
the overlap matrix. As a result: 



I - £l) ■■ - £i) - eo) x (124) 

Tq-i (-G [[0..L]]) fo-'t ('^(o) _ j ^ j j 



We see that the equation | = 1; contrast to the equation: 

(x^^^ |"H — £ m | X^^^) = — sm, is quadratic in all Hamiltonians, and the- 

refore can only be solved by taking the square root of a matrix. 

Hence, our strategy is to choose a Tq, which makes the non-orthonormality, 



X 



(L) 









(125) 



so small, that we may use an expansion like (100) to find Tq and the correspon- 
ding Hamiltonians . Of these, equals the variational Hamiltonian (99) 
with N substitTited by L, and its eigenvalues are therefore correct to order 2L + 1. 
Expression (124) now tells us that: 



y(-i)'^+bt)'^+^ (x(^) |x(^))f(-i)'^^«'' 



-‘o -'o ’ 



which may be solved to yield: 



/ : — (-1)^+' 

= + =fo 



i + id(^)-i 
1 - io(^) + 1 




(126) 



Here, the upper result is for L odd and the lower for L even. Since will 
be chosen small, and for L > 1 is usually of order (e, — ei) (e, — Eq) as we shall 
argue in (137) and (144), this transformation preserves the f?L-character of each 




Developing the MTO Formalism 



67 



NMTO. The Hamiltonian matrix (122) is seen to transform like the overlap 
matrix (124) with M substituted for L and, as a consequence, 

(-1)^ M + 1 (— 1)^ M + 1 

\/ 1 + dw -sm) + . (127) 

Similarly, from (123): 



G[0...M]"^ = G[Q...M]~^ + . (128) 

1 

A procedure for computing [1 + 0] ^ , which is more robust than the matrix 
Taylor series (126), is included in our codes [60]. 

Choosing Tq. Since the near-orthonormality conditions (108) merely fix the 
geometrical average | of successive sets, the nearly orthonormal 

scheme (122)-(124) only makes sense if the transformation Tq of the kinked 
partial waves at Sq is chosen in such a way that the non-orthonormality 
is small compared with the unit matrix. The ncarly-orthonormal scheme alone, 
does not make the orthonormalization integrals | converge towards 

the unit matrix, but make them behave like: 

/ -1 M 

This alternates with fluctuations depending on the size of (x*-°^ I X^°^) • 

The first thing to do is therefore to renormalize the MTOs in such a way 
that Tq^ ^0 = instead of (47). Hence, the first choice is: 

(129) 

where feg is the energy-mdependent diagonal matrix with elements 

(|'/’kl(^o)1 ^ = ^RL,RL i^o) = ^rl,rl{^o)- (130) 

Another choice is to start with a Lowdin orthonormalized Oth-order set: 

_ 1 

f“ = (131) 

where 0“ is the non-orthonormality of the Oth-order, renormalized MTO set: 

^ (fco“)~"^o (^o“)'"-l- (132) 

This choice therefore corresponds to taking L = 0. 
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Test case: GaAs. We have tested this orthonormalization method for GaAs 
using the minimal Ga spd As sp basis set and going all the way up to L = 3, 
that is, to a CMTO basis with the properties that 1^1 X^^^) a-nd 

(;^(3) I ^(3)^ 

= 1, SO that is a 7th-order Hamiltonian, and are of 
lower order, however, and neither of the three Hamiltonians commute. 

We diagonalized for L = 1, 2, 3 and compared with the band structures 
obtained with the corresponding non-orthonormal variational method discussed 
in Sect. 3.2. Both starting choices (129) and (131) were tried, and both gave fast 
convergence of the square-root expansions. The first choice which only requires 
evaluation of a square root at the last stage (127) but whose non-orthonormality 
0(i) 

is larger, was found to be the fastest [24]. 

Aleph-representation. The renormalization (129) is of the same nature as 
- but simpler than (due to lack of energy dependence) - the one performed 
in Subsection 2.3, where we went from phase-shift normalization to screening- 
sphere normalization. That diagonal transformation was given by (45) for the 
screened spherical waves, by (46) and (47) for the Oth-order MTOs, and by (49) 
for the KKR matrix. Since we distinguished between those two normalizations 
by using respectively Greek and Latin superscripts for the screening, e.g. a 
and a, and since it is irrelevant, whether one arrives at a nearly orthonormal 
representation from quantities normalized one-or-another way, it is logical to 
label quantities having the integral normalization (129) by Hebraic superscripts, 
e.g. N as corresponding to the same screening as a and a. Although not diagonal, 
and therefore influencing the shape of the kinked partial waves, also the Lowdin 
orthonormalization (131) is an energy-mdependent similarity transformation, 
and so is any of the following transformations: 

(j)^ (e, r) 5E (/)“ (£, r) X^^^^ (e, r) = (e, r) 

(133) 

K^{e) = A“(e) fo“ {e) = f^-'^ (e) 

with Tg arbitrary. From the latter energy-independent similarity transformation 
of G (e) , the non-Hermitian matrices and which are given in terms 

of G (e) by respectively (73) and (81), are seen to transform like: 

and fg ■ (134) 

This - (133)-(134) - has all concerned an energy-independent similarity trans- 
formation of Mfi-hatted quantities. 

In order to ensure that the hatted quantities are independent of which repre- 
sentation - a or K - we start out from, e.g. 

(£, r) = (£, r) = (£, r) f “ (e) = cj}'^ (£, r) f ^ (e) 



G^ (e) = G“ (£) = (£)^^ G“ (e) (e)”^^ = (e)^^ G^ (e) (e)“^ ^ 



and 
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where, from the latter, it follows that 

L^) = and 

it suffices to satisfy the relation: 

f‘<(£) = T’o“-if“(£) , which leads to : f^ = l. (135) 

In conclusion, under the substitution a — K, all previous equations remain valid, 
and the factors may be deleted. 

The virtue of this notation is that, once we have decided upon the normaliza- 
tion and the screening, wc can drop the superscripts: and this is what we shall do: 
Fi'om now on, and throughout the remainder of this paper, un-hatted quantities, 
i.e. the kinked partial waves, the kink and the Green matrices, and the Lagrange 
and energy matrices, are all supposed to have the integral (ortho)normalization 
(129) or (131), that is, they are all in the Aleph-representation. All equations 
derived previously are then unchanged, and Tg may be dropped. 



Accuracies of Hamiltonians. The accuracies of the Hamiltonians depend 
on the sizes of the corresponding non-orthonormalities. Specifically, since the 
residual error of the one-electron energy after use of the variational principle (5) 
for the set (r), 

= SiVi + (£i - £m) , 

is proportional to (ei — eg)^ .. (£i — £m)^ , neglect of the non-orthonormality, 
leads to the error: 

^ ^ + Cl|(£i-— £g) ..{Ei—EM) (136) 

where = vjO^^'^Vi and O means at the order of. The goal should thus be 

to reduce the non-orthonormality to: 

q{M) ^ 

because in that case, the error from non-orthonormality will be of the same order 
as that of the residual error. This can usually only achieved for M = L. 

The order of the non-orthonormality may be found by use of the difference 
function: 

(r) - (r) = $ ([01] , r) j ([012] , r) 
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obtained from (88) and where we should take = 0 if m < 1. As a result: 

q{M) ^ ^^(M) I ^(M) _ ^(M-1)^ 

= ^00 I 0 [ 01 ]) (137 

+ - eo) (<^ [01] 1 ^ [01]) (#(“-!) - £i) - £o) 



_ j'jj(M-l) _ 



which is usually of order — £i) — £o) when M > 1. 

To evaluate integrals like I ^[01]) we must transform to the original 
representation using (119) and then use (102). In this way we get: 

(00 I 0 [01]) = (00 I 0 [01]) + (00 I 0i) f [01] = K [[0] l]+K [01] f [01] . (138) 

Remember, that we are using the Aleph- normalization (133), because this influ- 
ences the right-hand sides. For a condensed mesh, (138) reduces to: 

^010^ = ( 0 1 0 ) + Fff = I + kf. 

We shall conclude this study of the accuracy of the Hamiltonians in Eq. (145) 
below. 



6 Connecting Back to the ASA Formalism 

What remains to be demonstrated is that the NMTO sets, (r) , (r) , 

and (r) , of which the two former are based on Lowdin-orthonormalized 
kinked partial waves at the first mesh point (131), and the last corresponds 
to the L= 1-set being orthonormal, are the generalizations to overlapping MT- 
potentials, arbitrary N, and discrete meshes of the well-known LMTO-ASA sets 
given in the Overview by respectively (1), (8), and (9). 

Since in the present paper we have not made use of the ASA, but merely 
a MT-potential - plus redefinition of the partial waves followed by a Lowdin- 
orthonormalization - we merely need to show that the formalism developed 
above reduces to the one given in the Overview for the case A=1 and a condensed 
mesh. In order to bridge the gap between the new and old formalisms, a bit more 
will be done though. 

N — 0, L — 0. For the Oth-order set we have: 

x(0) (j.) = y(0) (r) = 00 (r) = 0o (r) . 
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All un-hatted quantities in the present section will correspond to using kinked 
partial waves, transformed to be ortlionorinal at this hrst mesh point, eg. That 
is: All un-hatted quantities are in the Aleph-representation (133)-(135) with Tq 
given by (131). In this representation all previously derived relations hold, and 
in addition: 

To = 1 and Kq = 1- (139) 

Relating back to the Overview, this means that instead of the ASA-relation 
(13), we have (133) with Tq given by (131). The latter is the proper definition of 
Kq now that Kq = {(f)Q \ <Pq) is no longer diagonal. We now see that the un- 
hatted quantities used in the Overview were, in fact, in the Aleph representation. 
The overlap and Hamiltonian matrices for the Oth-order set are thus: 

(x^°^ I = (<(>o I 4>o) = (x^°^ I X^°^) = <^(^0 I <^o) = 1 

(x^°^ \n - eo|x^°^) = (x^°^ \n - £o| X^°^) = - £o = -Ko, (140) 

and with the Oth-order set being orthonormal, the Hamiltonian is variational. 
Hence, is the first-oidei, two-center, TB Hamiltonian of the 3rd- 

generation scheme. 



N — 1, L — 0. For the LMTO set we have: 

(r) = 00 (r) + 0 ([01] , r) - Eq) ^0 (r) + 0 (r) , 

where - as given by (90) - is seen to become the Hermitian, first-oidei 
Hamiltonian given by (140) if the mesh condenses. This proves (1). 

The Hamiltonian and overlap matrices were given in respectively (96) and 
(97), and using now K = 1 together with (102), we see that for a condensed 
mesh 

^ = -K + K^K 

= - £, + - £,) (0 I 0) - e ,) 

and 

(V'lx™) ^ + 

= 1 + - £,) (0 I 0 ) + (0 I 0 ) - e.) 

+ ( 010 ) 

which are exactly (7). Merely ^0 | 0^ is not a diagonal matrix of radial integrals 
like in the ASA. 




72 



O.K. Andersen et al. 



The nearly orthonormal LMTO set is: 

(r) = $o{r) + ([01] , r) - £o) , 

and the two conditions: | = 1 = (x^^^ I ; therefore lead to: 

(^4> [01] \ 4>o'j = 0 = (^$0 \ $ [01]^ , and | = 1 = (^4>o \ <Pi^ ■ 

Of these matrix equations, the first means that any 4 >rl {[01] , r) is orthogonal to 
any 4 >r'l' (£o,r) . As a consequence, the leading term of the non-orthonorniality 
(137) vanishes. The non-orthonormality of this LMTO set is then: 

6(1) = - £o) [01] I 0 [01]) - £o) , (141) 

which by use of (136) shows that the errors of the ff(i)-eigenvalues are: 

RS (^[01] I (^[01]) (£i - £i) (£i - £o)^ . (142) 

This is one order better than the error oc (£j — £q)^ obtained by diagonalization 
of but one order worse than the error cx (£j — £i) (£j — £q) obtained 

variationally using the LMTO set. Hence, is a second-order Hamiltonian. 
From (122): 

r ■■ 1 

7L(i)-£i = -GoG[[0]l]-iGo ^ -G I G = 

(^1 - i-K) = [l + (hW - s,) I <).)] (h(°) - £.) , 

which for a condensed mesh is exactly (8). 

For the transformation (115) from the x to the x-set, we get by use of (123): 

G[01]G[01]“i = -G[01]G[[0] 1]"1 Go 

r -1 r .. -1 -1 

^ -G I G = G^ I G = [i + (<).|<).)(h(°)-£,)]^' 

which - since from (102): \ (pj = (^cf) \ (pj - is exactly (8). 

The transformation (119) of the kinked partial waves is most easily found 
by using the orthogonality of (po (r) and <)>([01] ,r) together with (138). For a 
condensed mesh, the result is simple: 



P{r) = 0(r)+<^(r)r = <)> (r) - (r) ^(() | ^) = (r) - (^ (r) ^ , 
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and well known - see Eqs. (8) and (12). For a discrete mesh, things look more 
complicated in itl-languagc: From (138), 

f[oi] = -ii:[oi]“^ii:[[o] 1] = -j^[oi]~^ i-k[oi] ^ 

So — Si 

where the 2nd equation has been obtained by use of (154): F [[0] 1] = , 

together with: Kq = 1- For (119) we thus obtain: 

^([ 01 ],r) = ())([ 01 ],r) + 01 (r) f [ 01 ] 

= 0([Ol],r)(^l + (ei -£o)t[01]j + 0o(i’)F[Ol] 

= 0([Ol],r)X[Ol]-^ + 0o(r)f [01] 

= {0([Ol],r) + 0o(r)f [01]K[01]}/F[01]-' 

= {0([Ol],r) - 0o(r)i^[[O]l]}X[Ol]-' (143) 



where from (102): K [[0] 1] = (0o 1 4 > [01]) is iF® equivalent to the usual radial 
integral and the new factor K [01] in the transformation is caused by the presence 
of 01 (r) rather than 0o (r) on the right-hand side of the top line in (143). 

In order to complete the identification of the nearly-orthonormal LMTO 
representation for a discrete mesh with the ASA version (8) and (12), we need 
an explicit expression for the third parameter, which is the matrix entering the 
non-orthonormality (141). With the help of (143), and remembering that 0o (r) 
and 0 ([01] , r) are orthogonal, we get: 



(0[O1] I 0[O1]) = iF[Ol]-'(0[Ol] |0[O1]) 

= K [01]-' ((0 [01] I 0 [01]) - K [[0] if) K [01]-' 
= K [01]-' [k [[01]] - K [[0] l]^j K [01]-' 






r .. H 2 

K_ K 
3! 2! 



where, in the third equation, we have used (102). 



TB parametrization For tight-binding parametrizations of many bands over 
a relatively wide energy range, it is usually important to have as few para- 
meters as possible. Our experience [61,20] for the occupied and lowest excited 
bands of semiconductors and transition metals is that the off-diagonal elements 
of (00 I 0i) = K [01] , (00 I 0 [01]) , and <^0 [01] | 0 [01]^ may be neglected. This 

is in the spirit of the ASA. We therefore need to tabulate only those few dia- 
gonal elements, together with the single TB matrix . These quantities may 
then be used to construct for instance the Hamiltonian and overlap matrices 
I'M, - Si \ x^'^) and (x^'^ ] X^'^ ) . This is like in the ASA, but now, we neit- 
her need this approximation nor a condensed mesh. 
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TV > 1, L — 0. The nearly-orthonormal QMTO set is: 

(r) = ^0 (r) + |(^ ([01] ,r) + $ ([012] , r) - £i j | - £q) 

with the non-orthoiiormality: 

1 - X^'^) = (<^0 I [012]) - £i) + 

+ - £o) [10] I 0 [01]) „ ^(1)^ ^ ^ 

This - together with (136) - shows that the eigenvalue errors of are: 

^ ((j)o\$ [012]) (£i - £ 2 ) {si - £ 1 ) {si - £ 0 ) , 

which means, that is a second-oidei Hamiltonian like but different 

from it. In general, for TV > 1, the leading non-orthonormality is: 

6W « (<^0 I [012]) - £ 1 ) (^TVW - £ 0 ) , (144) 

as seen from (137). This means that remains a 2nd-order Hamiltonian 

when TV > 1, and that its eigenvalue errors are: 

w (^0 I 0 [012]) (£i - £jv) (£i - £ 1 ) (£i - £ 0 ) • (145) 

This is much inferior to the variational estimate obtainable with an NMTO basis. 
Moreover, the same result would have been obtained had we started out from the 
cheaper, renormalized scheme based on (129). Hence, with the present scheme 
only the Hamiltonians TV(^) with TW ~ L, have eigenvalues which are accurate 
approximations to the one-electron energies. 

TV = 1, L — 1. We finally use the general procedure (125)-(128) to orthonor- 
malize the nearly-orthonormal LMTO set considered above. The small parameter 
- the non-orthonormality - is thus given by (141). 

The transformation from the nearly to the completely orthonormal set is 
obtained from (128), with L = TW = 1, as: 

^(1) (r) = (r)G[01]G[01]^^ = (r) 1 + ", 

which is the generalization to discrete meshes and (overlapping) MT-potentials 
of the first equation (9). The resulting, orthonormal LMTO set is: 

X^^^ (r) = + .^([01],r) -£o) , 

with the third-ovdei Hamiltonian obtained from (127) with L = M = 1 as: 

T/(^) -£i = + - £ 1 ) [1 + 0(1) ". 
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This is the second ASA equation (9). 

For the transformation of the kinked partial waves, we have from (126): 



00 (r) 



00 (r) 



1 + 0 ^^) 



and putting all of this together, we may obtain: 

0([Ol],r) « 0([Ol],r) - ^o(r) -£o) (0[O1] I 0[O1]^, 

which is a new result. Finally, we may check that: 

I X^°^) = (00 I 0o) + - eo) (0 [01] I 0o) = 

1 + 0(1) - - eo) (0 [01] I $ [01]) - £ 0 ) (00 I 0o) « 1. 



7 Outlook 

Of the new developments described above, only the use of overlapping MT- 
potentials and efficient computation of total energies and forces from TB-LMTO- 
ASA charge densities were planned. Those parts turned out to be the hardest 
and still await their completion. But on the way, we did pick up a number of 
beautiful and useful instruments. Now that we have an accordion for playing 
Schrodinger, maybe Poisson can be learned as well. 
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9 Appendix: Classical Polynomial Approximations 

Lagrange and Newton interpolation. In these interpolation schemes, a fun- 
ction / (e) is approximated by that polynomial of fVth degree, /(^) (e) , which 
coincides with the function at the A -1-1 energies, eo,£i, .., £n, forming the mesh. 
The error is proportional to {e — £ 0 ) (£ — £ 1 ) .. (£ — En) ■ 

The expression for the approximating polynomial in terms of the A + 1 values 
of the function, / (£„) = /„, with n = 0, 1, .., A, is: 

N N 

/(^) (e) = ^ fn 4^^ (e) , where 4^^ (e) = f[ ^ 

n n 



( 146 ) 
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is the Lagrange polynomial of A^th degree. It has nodes at all mesh points, except 
at the nth, where it takes the value 1. Since Lagrange interpolation is exact for 
all functions with M < N, the Lagrange polynomials satisfy the sum rules: 
= T,n=o (^) , for M = 0, 

The same approximating polynomial may be expressed as a divided differenee 
- or Newton - series: 



N 

/W{£)= ^ /[o,..,M] n (£-£„) 

M=0 



M-1 



n—0 



(147) 



— / [0] + / [0> 1] (£ “ £o) + •■ + / [0...IV] (e — sn-i) ■■ (£ ~ £i) (s ~ £o) , 



where the square parentheses denote divided differences as defined in the follo- 
wing table: 

£o fo = f [0] 

£i fi = f [1] 

£2 / 2 ^/[ 2 ] 

In general, that is: 



_ /[m,m+ l,.,n] - / [m + 1, ., n, n + 1] 

= ^ . ( 148 ) 

'^n+1 

where m < n. Note that the two energies in the denominator are those which 
refer to the mesh points not common to the two divided differences in the no- 
minator. Also, note their order, which defines the sign. A divided difference, 
/ [0...M] , is thus a linear combination of /o, /i, ..., }'m- The divided differences 
entering (147) are those descending along the upper string in the table, but other 
forms are possible. Besides, the order of the energies need not be monotonic. In 
fact, all divided differences of degree M + 1 involving M speeifie mesh points 
are identical. This means that the order of the arguments in / [0, 1, ., M — 1, M] 
is irrelevant, as may be seen explicitly from expression (149) below. When we 
have a long string of arguments, we usually order them after increasing mesh 
number, for simplicity of notation. 

We may express any divided difference, / [0..M] , entering the Newton form 
(147) as a linear eombination of the /„’s with n < M, and thereby establish 
the relation to the Lagrange form (146). To do this, we apply both Newton 
and Lagrange interpolation to a function, which we take to be that Mth degree 
polynomial, (e) , which coincides with / (e) at the first M + 1 mesh points. 
This is allowed, because / [O..Af] is independent of the /„’s with n > M. In this 
way, we get the identity: 

M m— 1 M 

y(M){^) ^ ^/[0..m] (e-e„) = X^/n4^^(e) 

m—0 n—0 n—0 
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and taking now the highest derivative, we obtain the important relation: 

M ^ 

= Eh® — ( _ 

n=0 iim=0,#n £mj 

The inverse relation, that is the expression for /„ in terms of divided differences 
for a (sub)mesh containing e„, is of course just the Newton series (147) evaluated 
at the mesh point e„. 

In order to factorize {4>G) [0...N"] in expression (70) for the NMTO, we shall 
need to express the A^th-order divided difference of a product function, f (e) g (e) , 
in terms of divided differences oir the same mesh of the individual functions. Since 
the product is local in energy, we start by expressing its divided difference in the 
Lagrange form (149): 



N 



ifg) [0...N] = J2 



/n5n 



n=0 Ylm=0,^n 

For / (e) we may choose to use the divided differences in the upper, descending 
string of the table. We therefore use (147) to express /„ in terms of the divided 
differences on the (0..n)-part of the mesh and thereafter reorder the summations: 



N N 



M-1 



ifg) [0...1V] = ^ ^ f[0-M] n iSn-Sm') 



rim=0.yn 



n=0 M—0 
N 



m'—O 

N prM-1 / _ N 



im=0,^n 



M=0 



n=0 rim=0,#n 

Since HmE - £m') =0 for n < M, 

UZr=U^n-em') ^ ^ Y{Zr2,{en-em 



E 



=0 Ylm=0.^n ^rn) 



V=E 



,=M Ylm=0,^n 



n—0 



9n 



n—M i 
N 



E 



9n 



—M i Lm—]\ 



n=M iim=M,/n 

according to (149). We have thus proved the binomial formula: 



= g[M..N], 



N 

ifg) [0...1V] =Y,f [0-M] 9 [M..N] , (150) 

M=0 

which expresses the N'th divided difference of a product on the (O...A^)-incsh as 
a sum of products of divided differences on respectively the (0..M)- and (M..fV)- 
parts of the mesh, with M being the only point in common. Hence, this formula 
is in terms of the divided differences descending forwards along the upper string 
for /, and the divided differences descending backwards along the lower string 
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for g, but this is merely one of many possibilities. For the special case: g (e) = e, 
wc get the useful result: 

{ef)[0...N] = f[0..N-l] + SNf[0-N]. (151) 

Since the numbering of the points is irrelevant, we could of course have singled 
out any of the + 1 points, not merely the last. 

Newton interpolation has the conceptual advantage over Lagrange interpo- 
lation that the 1st divided differences, / [n — l,n] , are the slopes of the chords 
connecting points n — 1 and n, and hence approximations to the 1st derivatives, 
the 2nd divided differences, / [n — 1, n, n + 1] , are ’local’ approximations to ^ 
times the 2nd derivatives, and so on, as expressed by (71). For the mesh conden- 
sing onto the one energy, e,y, Newton interpolation becomes Taylor expansion, 
which is of course simpler. An example of this is the binomial expression for the 
A^th derivative of a produet: For a discrete mesh, there are many alternatives to 
(150), but for a condensed mesh, there is only one expression. 



Hermite interpolation. It will turn out that the NMTO Hamiltonian and 
overlap matrices are best understood and computed using the formalism of Her- 
mite interpolation. Here, one seeks the polynomial of degree M + N + 1 which 
fits not only the values, /„, at the N + 1 points, but also the slopes, fn, at a 
subset of M + 1 points. We shall number the points in such a way, that the M + 1 
points are the first. This polynomial is: 



jiM+N+l) 



M 

E 

n=0 



M 



/n + fn- fnl ^ 



N 

E 



N 



(£ - £n) 



x4“) (£) e) (£) + ^ /„ 4“+'^ (£) (e) 



For those interested in why this is so, here are the arguments: The product 
of Lagrange polynomials 



M 



4""44 4^^(4= n 

m=0.7^n 









! N 

n 



e - Sn 






with 0 < n < M, is of degree M + N. At a mesh point, e = £„', this product has 
value 1 when 0 < n' = n < M, value 0 and slope 0 when 0 < n' ^ n < M, and 
value 0 when M < n' < N. Since the slope is: 



M 






im—0,^n 



e- Sr, 



N 

E 



S -Sr. 



(e) is) 
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the polynomial of degree Af + TV + 1 : 








N 

+ E 




is) is) , 



with 0 < n < M, has value 1 and slope 0 if e = e„. If £ = £„, it has value 0 

and slope 0 when 0 < n' < M , and value 0 and some slope when M < n' < N. 
The polynomial of degree M + N + 1 : 

(e - e„) (£) li^'> (e) , 

with 0 < n < M, vanishes at all mesh points, has slope 1 for e = e„, slope 0 
for £ = £„' 7 ^ £„ when n' and 0 < n' < M, and some slope when M < n' < N. 
Finally, the product: 



(e) is) = 




2 N 



n 






with M < n < N, is 'd polynomial of degree M + TV + 1 . For e = £„< it has value 
0 and slope 0 if 0 < n' < M, value 0 and some slope if Tlf < n' ^ n < N, and 
value 1 and some slope ii M < n' = n < N. 

What we shall really need is, like in (149), times the highest deriva- 

tive of the polynomial y(*f+^+i) (g). Calculated as the coefficient to the highest 
power of £, this Hermite divided difference is: 



(M+AT+l) 

y(M+W+l) 

(Af + A^+ 1)! 



M N 

fn — fn i eTEc ^ 

\ n'=0,^n ”■ n'=M+l ^ 



M 



N 



M 

E — 

n iSn - Sm)^ n (£„ - Em) 

m=0,7^n 
N 



E 



fn 



M N 



n-M+l Yl (£„ - £m) 

m—0 m—M+l,^n 



lim / [0 M + N + l\ = f [[0...TW] ..TV] . 

€— >0 



(152) 



In the last line, we have indicated that the Hermite divided difference may be 
considered as the divided difference for the folded and paired mesh: 



£o EAT+I Si En+2 • • • • Sm Sm+N+1 ■ ■ Sn 

in the limit that the energy differences, e„ = £„+jv+i — between the pairs tend 
to zero. In analogy with the notation for the divided differences, we have denoted 
the (TM + TV + l)st Hermite divided difference: / [[0...TH] ..TV] , which means that 
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the mesh points listed inside two square parentheses have both /„ and /„ asso- 
ciated with them, whereas those listed inside only one square parenthesis have 
merely /„. Like for the divided differences, the order of the arguments inside a 
square parenthesis is irrelevant, but for long strings we usually choose the order 
of increasing n. For a condensed mesh, 

(M-l-Af+l) 

f[\0...M]..N] — — (153) 

As examples of Hermite divided differences we have: 

/[[0]] = /o /[[0]1] = ^^ 

(154) 

^ [[ 01 ]] = /[[]0..iv] = /[0..Ar] 

In the NMTO formalism the Hermite divided difference (152) conies in the 
disguise of the following double sum (92): 



N M 



EE 

n—O n'—O 



n 



N 

m—0,^n 






f[n,n'] 




(155) 



which may, in fact, be viewed as a divided difference (149) - albeit in two dimen- 
sions - but that brings little simplification. So let us prove that (152) and (155) 
are identical: First of all, the /„-terms of the double sum (155) are those for 
which n = n', and they obviously equal those of the single sum (152). Secondly, 
the /„-terms in (155) are: 



N M r , X-1 f , \-l 

E \ - Jn\Sn^£n') + In' \^n' ^ ^n) 

2-^ -nN 



n=0 n'=0,^n rim=0,#n ^ Ylm= 0 ,^n' rt' £m) 

N ^ M 



E 



fn 



T, : + <i“) 



n=M+l llm=0,#n ~ n'=0 llm=0,#n' 



M , X -1 



M r 

E Jn 

T— rA^ / \ / T—rAf / \ 

n=0 llm=0./n n'=0.^n llm=0,#n' ~ ^m) 



N . s-l 

V^n ) 



M r 

In 

T— rAf / \ / ^ TT"^ ( 

n=0 llm=0.^n n'=0.#n llm=0,^n' ~ ^m) 



Now, according to (149), 



M 



n'—O llm=0,^n' '-mj 



[0...M] 



(157) 
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is the Mth divided difference of the single-pole function 1/ (e„ — e) , provided 
that n is not on the nicsh For the sum where n is on the mesh - but the 

n'=n-term is excluded - we have: 

1 M -1 



(er. 



M 1 M 

— 

/ ^ t-tM / \ / -/ T— rA/ / \ 

n'=0,^n llm=0,#n' n'=0,^n llm=0,#n,#n' V^«' ~ 

= 2 ^ + l"-^] • (158) 

(£n - e) 

This result also holds if M is named iV, and is therefore relevant for both of the 
last terms in (156). We then need simpler expressions for the divided differences 
of the single- and double-pole functions. Guided by the results: 

1 d"” 1 1 1 1 M + 1 



M\ de“ i 



' (£i - e) 



M+i ■ 






M+2 • 



for the derivatives, we postulate that for a discrete mesh, 



1 



Si- s 



[0...M] 



1 



1 



rim=0 - ^ra) {Si ~ s) 



[0...M] 



Z^n— 



n—0 €.i-€n 






(159) 



For M=0, these expressions obviously reduce to the correct results, {si — Sq) ^ 
and (si — So) ^ ■ For M > 0, our conjectures inserted on the right-hand side of 
(148) and subsequent use of (149) yield: 



^[0.,M-1]-^[1..M] 
£o ^ Sm 



1 




M 

m—0 



{Si 



£m) 



1 



Si-S 



[0...M], 



(ei-s) 






M 

E ; 

n—0 



So — Sm 



rim=0 (^i - ^rn) (Si ~ s)" 



[0...M], 



which are obviously correct too. Hence, equations (159) have been proved. 

Using finally (159) in (157) and (158), and right back in (156), leads to the 
/„-terms in (152). We have therefore demonstrated that: 



tt— =/[|0...M|..JV|. (160) 

n-On'-O ^Sn'-Sm') 

m=0,^n m'—O^^n' 

The final expression needed for the NMTO formalism, is one for the Her- 
mite divided difference of the product-function ef (e) . For this we can use (151) 
applied to the folded and paired mesh. As a result: 



(e/)[[0...M]..iV] = /[[O..M-l]..fV] + £M,/-[[0...M]..iV]. (161) 



Since the numbering of the points is irrelevant, we could of course have singled 
out any of the M + I points, not merely the last. 
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Abstract. To combine the simplicity and efficiency of atomic-sphere approxima- 
tion (ASA) based electronic structure calculations and the accuracy of full potential 
techniques, we have developed a full charge- density (FCD) method. In this method 
the charge density is obtained from the output of self-consistent linear muffin-tin or- 
bitals (LMTO) ASA calculations, the Coulomb energy is calculated exactly from the 
complete, nonspherically symmetric charge density defined within nonoverlapping, 
space-filling Wigner-Seitz cells, and the exchange-correlation energy is evaluated by 
means of the local density approximation or the generalized gradient approxima- 
tion applied to the complete charge-density. The kinetic energy is obtained as the 
ASA kinetic energy corrected for the nonspherically symmetric charge-density by 
a gradient expansion. The integration over the Wigner-Seitz cell is carried out by 
means of the shape truncation function technique, which is also discussed in detail. 
The FCD technique retains most of the simplicity and computational efficiency of 
the LMTO- ASA method, while several tests for bulk metals and surfaces show that 
the accuracy of the method is similar to that of full potential methods. 



1 Introduction 

As a consequence of the rapidly increasing computational facilities followed 
by the development of computer codes, the ab initio electronic structure me- 
thods arc able to treat more and more complicated problems, closely related 
to applications, with sufficiently high accuracy. During the last two deca- 
des the linear muffin-tin orbitals (LMTO) method [1,2, 4, 9, 3, 6, 7, 5, 8] has been 
one of the most commonly used method in electronic structure calculations. 
In particular, due to its simplicity and extreme computational efficiency it 
has been extensively used in total-energy calculations for close-packed high- 
symmetry systems where the atomic sphere approximation (ASA) may be 
applied with sufficient accuracy. However, if the local arrangements of atoms 
deviate strongly from spherical symmetry or the atoms change their positions 
away from high symmetry positions the ASA cannot be applied. Thus, alt- 
hough the LMTO- ASA may be used to calculate, for instance, the electronic 
pressure, it cannot in its conventional implementations yield forces and, if 
uncorrcctcd, the ASA breaks down when used to calculate clastic shear mo- 
duli. To increase the number of systems to which the LMTO method may be 
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applied, including systems with low symmetry, one has developed a number 
of full-potential (FP) LMTO techniques [10,11,12,13,14,15]. These techni- 
ques are of course highly accurate but lack the efficiency of the LMTO-ASA 
method. Hence, they may be used in static but not in molecular dynamics 
calculations and they cannot be used in many applications. 

According to the theorem of Hohenberg and Kohn [16], there exists a 
unique energy functional which is variational in the density. Hence, if the 
functional is evaluated with a trial density close to the exact ground state 
density, the error in the total energy is only of second order in the diffe- 
rence between the trial density and the ground state density. This variational 
property means that in many cases one can achieve the required accuracy 
simply by evaluating the total energy functional using an appropriate trial 
density and thus avoiding the most time consuming part of the calculation, 
the self-consistent iterations. In order to do this, one has to answer the follo- 
wing questions: How does one construct densities which applied in the true 
functional yield total energies of sufficient accuracy? In the context of the 
LMTO method one has the related question: How does one evaluate the true 
functional rather than the approximate ASA functional? It is the purpose of 
the present paper to provide one answer to those questions. 

In the following, we describe and test an efficient technique for total energy 
calculations based on the LMTO-ASA method in the tight-binding (TB) re- 
presentation [9, 6, 7, 8]. According to this, we use the complete, non-spherically 
symmetric charge density generated in self-consistent ASA calculations to 
evaluate the true energy functional. We have developed the new technique 
during the last years and used it successfully in many applications [17,18,19], 
[20,21,22,23,24]. In the first version of the method we used the uncorrected 
ASA kinetic energy for the total energy calculation and only the electrostatic 
and exchange-correlation terms of the energy functional were evaluated from 
a complete non-spherical charge density. In many applications this approxi- 
mation proved to be sufficiently accurate, like e.g. in studying the ground 
state atomic volumes of open crystal structures such as the a-phascs of the 
light actinides [19,20,21]. It has turned out, however, that although the ASA 
kinetic energy is often a suitable approximation, it does not, for instance, yield 
sufficiently accurate total energies for the small orthorhombic and tetragonal 
deformations needed in calculations of clastic constants [22]. Therefore wc 
improved the kinetic energy calculation beyond the ASA and thereby take 
the remaing step towards constructing the true energy functional. 

Finally, we point out that although in its present form our full charge 
density (FCD) total energy calculation is based on the conventional LMTO- 
TB method, it can be an even more promising technique in many applications 
if it is combined with the recently developed exact muffin tin orbitals method 
by Andersen et al. [25]. The new method gives more accurate interstitial 
charge density and kinetic energy, and thus the accuracy of the FCD total 
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energy can be substantially improved. The development of such a new FCD 
technique is in progress. 

The structure of the paper is the following: In Section II we set up the 
FCD total energy and in two subsections we discuss the details of the kinetic 
and exchange-correlation energy, as well as the Coulomb energy calculations. 
We put emphasis on the calculation of the kinetic energy correction to the 
ASA and the technique used in the determination of the intcrcell Coulomb 
(Madelung) energy. In Section III we describe the construction of the charge 
density in a one-center form using a general muf&n-tin orbitals formalism. 
The expressions valid on the LMTO basis are given in the Appendix C. In 
the calculation of the integrals over the Wigner-Seitz cell we used the shape 
truncation function, or simply shape function technique, which is discussed in 
Section IV. Some important calculational details faced to actual applications 
are presented and discussed through several examples in Section V. The ac- 
curacy of the FCD method is demonstrated in comparison with full potential 
and experimental results. Finally the paper is ended with the Conclusions. 

2 Energy Functional 

Within density functional theory the total energy of the system may be de- 
composed in the form [16] 



E[n] = G[n]+F[n], (1) 

where G[a] is a universal functional consisting of the kinetic energy T[n] of 
the non-interacting system and the exchange-correlation energy Exc[n], i.e., 



G[n] = T[n] + Exc[n], (2) 

and F[n] is the Coulomb contribution to the total energy 

F[n] = J v{r)n{r)dr +\jj ^ t^rdr'. (3) 

Here, u(r) is an external potential. The total charge density n(r) may be 
given by the sum 



= ( 4 ) 

R 

over lattice positions R of atomic-centred charge densities nn{rn) defined 
within space Filing, non-overlapping cells Qn, which in turn may be written 
in the one-centre form [20] 
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nR{rR) = ^nRL(rfi)y£,(ffl), (5) 

L 

where L is short-hand notation for r/j = r — R, and Yr is a real 

harmonic. These atomic centred charge densities arc normalized within the 
cells and the total charge density is continuous and continuously differentiable 
in all space. 

In the following we assume that the total energy functional, in accordance 
with (4) may be partitioned into cell-contributions, i.c. E[n] = E!R[n], 

where ErIti] = GrIu] -I- Tfi[n]. Due to the non-local character of the interac- 
tions these functionals depend on the total density (4). However, in the local 
density as well as the generalized gradient approximation to the exchange- 
correlation and kinetic energies Gr depends only on ur. 

2.1 Kinetic and Exchange-Correlation Energy 

In the self-consistent ASA based methods both the Schrodinger equation and 
the Poisson equation are solved within the spherical symmetric approxima- 
tion for the charge density and the Wigner-Seitz cell. When self-consistency 
has been reached the Coulomb energy may be evaluated exactly by solving 
Poisson’s equation for the proper charge density and atomic polyhedron. Ho- 
wever, the one-electron energies and, hence, the kinetic energy, will reflect the 
approximation used in the solution of Schrodinger’s equation. Therefore, in 
order to achieve the required accuracy of the total energy we correct the ASA 
kinetic energy. There are several reasons why a correction to the ASA kinetic 
energy of the kind presented here has not been previously attempted. First 
of all, in most LMTO calculations the electrostatic and exchange-correlation 
terms have been evaluated from a spherically symmetric charge density and, 
hence, there is no need for a more accurate kinetic energy. Secondly, the 
kinetic energy, which is obtained from the Kolin-Sliam equations [26] as 



UCC P 

= n^^\r)v^i^{r)dr, ( 6 ) 

j 

where ej are the one-electron energies, n{r) the electron density, and the 
Veff{r) the effective potential, is variational in the potential, and it has often 
been assumed that the ASA kinetic energy is in fact sufficiently accurate. 
Finally, to improve on the ASA kinetic energy one would need to know an 
explicit kinetic energy functional, e.g. in the form of a gradient expansion. 
However, in view of the relatively slow convergence of the known kinetic 
energy gradient expansions it is not obvious that this would in fact lead to 
the required accuracy. 
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The solution to this impasse is to evaluate the main contribution to the 
kinetic energy in the ASA and then apply an approximate functional form 
to evaluate the difference between the ASA and the true kinetic energy. This 
remainder is presumably small and may be obtained with sufficient accuracy 
by a gradient expansion. A similar approach based on Hartree-Fock densities 
has been used in atomic calculations by DePristo and Kress [27]. The pro- 
cedure is closely related to the modern gradient correction to local density 
functional theory and as we shall demonstrate the corrected FCD method 
has the accuracy of the full potential methods while retaining most of the 
simplicity and efficiency of the LMTO-ASA. 

In principle, the exchange-correlation energy can be evaluated directly 
from the total charge density. However, in many cases it is useful to use an 
expansion arround a uniform or spherical symmetric charge density. There- 
fore, in the following we consider the universal functional G[n] and we will 
separate the kinetic and exchange-correlation terms later. The energy density 
g, corresponding to the functional Gfl[n/j], is defined as 

Gfl[rifl] = / g{[nii],rR)drR (7) 

j 

which may, within the density-gradient approximation, be expressed as [28] 



9{[nR],rR) = t{[nR],rR) + exci,[nR],rR)nR{rR) 

= t{uR, iVnflp, ...) + exc{nR, IVn^l^, ...)nfl(rfl) (8) 

= 5(M)> 

where t and e^cn are the kinetic and exchange-correlation energy densities, 
respectively. For charge densities which deviate weakly from spherically sym- 
metry may be represented by a Taylor series around the sperically 

symmetric charge density i.e.. 



5(M) = 9{[nR\) + fiR{rR) 



dg{[nR\) 



, 1 - / ,2 d‘^9{[nR]) 

+ ^nR{rR) 



dn\ 



duR 

1 

0 2 



-I- Vhfl(rfl) 



<9.g([nfl][ 

dVuR 



-/wr. ^^2 d^9{[nR]) 












duRdS/uR 



( 9 ) 



where hfl(rfl) = nfl(rfl) — n^{rR). As a result, the universal functional may 
be expanded in the following form 



GrIur] — + ■•■> 



( 10 ) 
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which may be used to calculate the total energy provided one knows the 
energy density functions and the corresponding gradients. Unfortunately, this 
is not the case and one must resort to approximations. 

Within modern density functional theory the problem is solved, as far 
as the exchange-correlation energy Exc-,R[nFt\ is concerned, by means of the 
local density approximation (LDA) or generalized gradient approximation 
(GGA) [29] which yield analytic expressions that may easily be applied in 
conjunction with the full LMTO charge density. Thus, only the kinetic energy 
remains to be accurately evaluated. Here, the problem is that neither 
the Kohn-Sham equation (6) in the ASA nor a straight density gradient 
expansion of the kinetic energy based on the explicit analytic expressions 
given, for instance, in Ref. [28] have sufficient accuracy when used seperately. 
However, as we shall show in the following one may by a combination of the 
two techniques in the form of a density-gradient correction to the ASA obtain 
kinetic energies with the desired accuracy. 

We proceed by isolating the lowest order terms in (7 - 10) which may be 
evaluated in the ASA and the ’’small terms” which may be evaluated using a 
suitable functional form. In the ASA the kinetic energy is obtained from the 
Kohn-Sham one-electron equations in the form (6) which depends only on 
the spherical average of the charge density, because the effective one-electron 
ASA potential is spherically symmetric. Hence, viewed as a functional of 
an arbitrary density, equation (6) would give the same value for any non- 
sphcrically symmetric charge density having the spherical average It may 
therefore be identified as the kinetic energy belonging to the charge density 
Mjj. Thus, we write 



[ 4 ] « ( 11 ) 

where is the kinetic energy obtained in the ASA from a spherical sym- 

metric self-consistent calculation and the second term is a ’’small” shape- 
correction connected with the fact that the kinetic energy T^[vPp\ correspon- 
ding to the spherically symmetric charge density is defined within the 
Wigner-Seitz cell at R while the ASA kinetic energy is defined inside the 
corresponding atomic sphere. Within the LMTO-ASA method the kinetic 
energy may be expressed by means of the ASA Hamiltonian 
one-electron wave functions as [30] 



- f n^^^{rR)veff{[n^^^],rR)drR, (12) 

JSr 

where Vef f{[n^^],rR) is the effectiv one-electron potential, Sr the atomic 
Wigner-Seitz radius, and ri^®^(rfl) the ASA charge-density normalized wit- 
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hill the atomic sphere which is equivalent to inside of the cell and 

sphere. This form may include the so-called combined correction [7,30]. The 
shape-correction term in ( 11 ) may be obtained as 

A[n%n^^^]= f t{[nl])drn- [ dm, (13) 

Jqr Jsr 

where the first integral is performed within the Wigner-Seitz cell, 17^, and 
the second one within the atomic sphere. 

The shape correction and the kinetic energy part of the higher order terms 
in ( 10 ), i.e., those of first and second order in hji and Vn^, arc evaluated by 
means of a semi-local kinetic energy density functional. In Appendix A we 
present some functionals used in the actual applications. 

For strongly anisotropic electron densities, like in the case of surfaces, the 
expansion ( 10 ) is not convergent. In this case the exchange-correlation energy 
has to be evaluated by a direct three dimensional integration of the LDA or 
GGA functional, while the non-spherical kinetic energy correction, i. e. the 
higher order terms in ( 10 ), is calculated as 



/ t{[nR])drR - f t{[ri%])drR. (14) 

J ilji J flri 

This procedure is much more time consuming than the Taylor expansion (10). 

In order to show the effect of the correction terms to the kinetic energy 
we consider an orthorhombic shear deformation [22,31] of bcc Mo, which can 
be used to determine the C 44 elastic constant. In Fig. 1 we show the total 
energy, with and without the kinetic energy corrections, versus the relative 
deformation parameter d. 

The energy determined from the experimental shear elastie constant [31] 
is also shown. We observe that using the ASA kinetic energy the calculated 
C44 is negative and, hence, the bcc structure of Mo will be unstable against 
such an orthorhombic distortion. Obviously, the ASA kinetic energy is not 
sufficiently accurate to render the bcc structure of Mo stable. It is only when 
the kinetic energy correction is applied that a positive C44 is obtained which 
is in fact very close to the measured C44 value. 

2.2 Coulomb Energy 

In this section our aim is to calculate the Coulomb energy of a given charge 
distribution in a solid with arbitrary symmetry. We devide the total elec- 
trostatic contribution belonging to the cell at R into the intracell, that de- 
pends only on the electron density in the cell, and intercell terms, i.e. 



Fr[u] = E“-[ufl] + E“‘“[Q]. 



(15) 
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Fig. 1. Change of the total energy of Mo for orthorhombic shear deformation as a 
function of the relative deformation parameter d. 



The intracell energy 



[nR\ = 



I 

J Qr 



tr 




nRi^R) 

kfl - t^'r\ 




nR{rR)drR, 



(16) 



where Zr is the atomic number, may be determined by solving the 1— dependent 
Poisson equation or by numerical integration using, for instance, the shape 
function technique [2,17,32], Denoting by fiRR{rR) the Yr(tr) projection of 
on a spherical surface of radius rR that lies inside the cell and per- 
forming the angular integrations the expression (16) for the intracell energy 
can be brought to the form 



Fr'^^r] = 



+ 




■\/47r 



L 

QRL{rR) 



nRL(rR) PRL{rR) 



rRdvR 



(17) 
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where 



PRL{rR) 



21 + 1 






^RL\r^) 




{r'n) 



R)^d,r'[i 



and 



QRL{r-R) 



21 + 1 ,/o 



RRL{r'ji) 




{''''r)^^P'r 



^L,{0,0) Zr. 



(18) 



(19) 



Here S’^ stands for the radius of the sphere circumscribed to the Wigner-Seitz 
cell Hr at R, and S is the average Wigner-Seitz radius. The explicit form for 
the fiRL{rR) function will be established later using the shape functions. 

In Fig. 2 the intraccll Hartrcc energy of fee Cii is plotted relative to its 
converged value as a function of Imax used in (17). As may be seen from the 
hgiire the energy difference of 0.3 mRy obtained for Imax = 8 — 11 is reduced 
below 0.1 mRy already for /max = 12. In the actual calculations we found that 
for a wide range of structures a reasonable accuracy of the intraccll energy 
given by (17) can be achieved by performing the summation over I up to 




I max 

Fig. 2. The convergence test for the intracell Hartree energy of fee Cu. The results 
are plotted relative to their converged value as a function of the maximal I value 
used in (17) or in (43). 
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The intercell interaction energy belonging to the cell at R may be written 
in the following form [17,33] 



^inter 

R 



Q] = 

2S ^ 21 + 1 

R,' L 



bRR’ 



47t(21"-1)!! c 

X 7TTT0i",Z+Z' JL",1 

L"' 



[21 - 1)\\{21' - 1)\\ 



Yl{^rr') ^2 Qrl'^L', 

L',L" 

(R' — R + 



( 20 ) 



and it can be completely described in terms of the mnltipole moments dehned 
as 



Qrl = j dvR - Sr^(o,o) Zr 

= Qrl{S%)- (21) 

In (20) 5'i^i:,'(R) are the conventional LMTO structure constants and R' 
runs over the lattice vectors. The displacement vector hRRi has to be chosen 
in such a way that the circumscribed spheres of the cells in question do not 
intersect each other (here we assume that the directions of Ijrri and R' — R 
coincide) 



\H' --R\ + bRw > S'r + S^r,. (22) 

For the cells with nonoverlapping bounding spheres we can choose Brri = 0 
so the equation (20) reduces to the well-known form 

F“"[Q] = ^ SL,L'{Rno)QR„^L' (23) 

L R„o,i' 

where R„o runs over the cells with nonoverlapping bounding spheres. 

For neighbouring cells with overlapping bounding spheres Brri according 
to the inequality (22) has to be nonzero. Furthermore, as was pointed out 
by Gonis et al. [33], the outer sum (over 1) is conditionally convergent and 
for a fixed value of the upper limit of the suimnation over I' , 1'", it may start 
to diverge above a certain value of I thus defining a range of convergence 
of the summation. It was also shown that the range of convergence for the 
summation in (20) depends sensitively on the choice of Brri [33]. For larger 
values of Brri the sum converges more slowly, but the range of convergence 
is wider. Therefore we expect an optimal value for Brr' to exist , which 
is large enough to satisfy (22) and to ensure a wide range of convergence 
but at the same time the summations over I converge rapidly allowing us 
to use a reasonably low Imax value in the actual calculations. The choice 
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^RR' = |R' — R| proposed by Goiiis et al. [33] is suitable for cubic systems 
[17,18]. However, for a crystal with lower symmetry it may happen that the 
convergency ranges for different neighbours do not even overlap with this 
choice of bRRi (e.g. for a tetragonal lattice with large c/a ratio). We have 
solved this problem by choosing the displacement vector equal to the radius 
of the circumscribed sphere 

On the basis of a simple model [34], described in Appendix B, we have 
shown that the displacement vector is related to the circumscribed spheres 
radii as 



+ |R'-R| = (l + a)(5^ + 5^,), (24) 

where a is a small positive number which determines the ratio between the 
maximal orbital quantum numbers included in the outer and inner summa- 
tions, i. e. 

^max/^inax — (25) 

In order to ensure similar accuracy for different neighbours we have to choose 
a = const. We can sec that the limit where a tends to zero corresponds to 
the lower limit of 6 _r_r/ + ]R' — R] in the inequality (22), but in this case the 
inner summation should go to infinity. For finite bRRi + |R' — R| has to 
be chosen according to (24) above its lower limit to assure the convergency 
of the inner summation for any 1. With decreasing a, the number of terms 
in the outer sum decreases. On the other hand, with decreasing a, we need 
more and more multipole moments in the calculation of the inner sum. By 
choosing the coordinate system with 2 axis pointing in the direction of the 
neighboring atom R' the summations in (20) can be evaluated efficiently even 
for very high f^ax values. Therefore, it is preferable to choose relatively high 
a values in order to minimize the computer time needed for the calculation 
of the multipole moments. 

In order to test the accuracy of the intcrccll energy term evaluated by Eq. 
(20) and using for the displacement vectors the recipe given in (24) and (25) 
we calculated the Coulomb energy of a homogeneous charge distribution of 
several lattices with different symmetry. In these calculations we took a = 0.5 
and Zmax = 20 — 22. The scaled Coulomb energies — i.c., 

the average Madelung constants are plotted in Fig. 3 as a function of /max for 
cubic, tetragonal, a — Np and a — Pu structures. The exact results obtained 
by the Ewald technique are also indicated in the figure. For the intracell 
Coulomb energy given in (17) and for the intcrccll Coulomb energy for cells 
with nonoverlapping bounding spheres (23) the converged values were used 
for any /max. 

We can see from the figure that the results converge to the exact values 
smoothly in each case, indicating that using (24) the sums over / converge 
simultaneously for different neighbouring cells. The relative deviations from 
the exact values are less than 0.03% for /max = 20 in each case. 
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Fig. 3. The scaled average Coulomb energy per cell of a homogeneous charge dis- 
tribution (Madelung constant) of (a) simple, face-centered and body-centered cubic 
and (b) tetragonal, a — Np {oP8) and a — Pu (mP16) structures as a function of 
^max- For the tetragonal lattice c/a = 1.5 was used. The exact values for the Made- 
lung constants obtained by the Ewald procedure are indicated by dashed lines. 



3 Construction of the Charge Density 

It was shown by Andersen et al. [9,35] that even for open structures such as 
the diamond structure one may obtain good charge densities by means of an 
LMTO-ASA potential. In their approach, however, the output charge density 
is given in a multi-centre form which requires double lattice summations and 
is therefore less suitable in total energy calculations. Our aim is to rewrite 
the output ASA charge density in a one-centre form (5). This expression is 
simple to evaluate and well suited for the integration in the Wigner-Seitz cell 
at R. 

The muffin-tin (MT) orbitals for the low I orbital quantum numbers are 
defined as [35] 



= YL(rfl) 



PRi{e-,rR) + PRi{e)ji{K,rR) for < Sr 
ni{K,7-R) iOTTR>SR, 



(26) 



where ^Ri{c,rR) is solution of the radial Schrodingcr equation, = e — vq 
is the ’’kinetic energy in the interstitial region”, ji and ni are, respectively, 
regular and irregular solutions at the origin of the radial wave equation for 
the constant potential vq (the muffin-tin zero), and Pri(c) is the potential 
function. In a standard self-consistent calculation only the s,p, d(and /) par- 
tial waves are included in the basis set, therefore in (26) I < Zmax = 2(or 
3). 
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The trial wave function for the energy e is set up as the linear combination 
of the MT orbitals 



V’(e, J") = ^ XRL{e, yr)url- (27) 

RL 

The expansion coefficients u^rr as well as the energies ej are determined 
from the condition that the wave function tp{e, r) should be a solution of the 
Schrodinger equation for the muffin-tin potential 

‘I’'* (28) 

[Vo li VR> Sr 

In order to set up algebraic formulation of this condition we expand the tails 
of the riR{K,rR) = ni{K,rR)YL{vR) envelope functions arround the site R' in 
terms oiihe jli^k^tri) = ji'{K,rRi)YRi{vRi) functions, i.e. 



nL{K,rR) = - '^jL'(K,rRi) Sr'L',rl{r)- 
L' 



(29) 



Here the expansion coefficients are the well known structure contants. It is 
very important to note that for each I the I' summation in (29) goes to infi- 
nity, and, because at high orbital quantum numbers the centrifugal potential 
becomes dominant, all the high I' terms from the right hand side of (29) arc 
solutions of the Schrodinger equation. 

For the energies ej where nontrivial solution of the secular or tail cancel- 
lation equation exists the multi-centre form of the wave function (27) inside 
the MT sphere at R' reduces to the one-centre form 



if r/?/ <Sw. (30) 

L' 

This is the expression that is used during the self-consistent iterations. In (30) 
two significant approximations have been made: i) the use of the incorrect 
basis functions in the overlap region, and ii) the neglect of the high V terms 
in the expansion (29). 

In the case of overlapping spheres, there is, in the region of overlap, an 
uncancelled remainder which is the superposition of the functions 



/fii(e,rfl) = XRL{e,YR) - nL{n,rR) = /ra(e> (31) 



coming from the neighbouring sites [35]. Here, 
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fm{e,rR) = ifim{e,rR) + Pm{e)ji{e,rR) - ni{e,rR), (32) 

and OrItr) is a step function, which is 1 inside and zero outside the MT 
sphere at R. Thus, in the case of overlapping spheres centered at R the wave 
function at R’, i.e. tri < Sri, should be corrected as [35] 



tp{ej,rRi) + EE JrL' i^j !^r)u^rri ■ (33) 

L' R L' 

In Ref. [20] we describe a method whereby the / function can be included 
in the one-centre form of the charge density. In many systems we found that 
this correction is negligible compared to the effect of the high I' terms from 
the expansion (29). Therefore, here we focus on these terms and present a 
technique how the high tail components can be taken into account when the 
one-centre charge density is constructed. 

Including the high I' terms in (29) the approximate wave function can be 
written in the following simple one-centre form 

V'(ej,rfl') = Y^VR'L'{(-j,rR,)u’j^,j^, - ^ :jL'{Kj,rR,)w>^,^,, (34) 

L' L' 



where the second summation includes only terms with I' > This expres- 
sion is valid inside and outside of the MT spheres as well. When tri > Sri 
the partial waves ipRui has to be substituted by the proper free electron so- 
lution nil — PR'iijii ■ In (34) we have extended the definition of the expansion 
coefficients for the high I' quantum numbers, i.e., 



> ^max and I < /max- (35) 

RL 

In this expression the off-diagonal {V > 1) structure constants are involved. In 
the conventional, unscreened, MTO method the calculation of these matrix 
elements is obvious. In the case of tight-binding representation the high- 
low subblock of the structure constants is constructed using the so called 
’’blowing-up” technique [7]. The screening parameters for the high I indices 
are set to zero, and therefore the internal I” summation in the Dyson equation 
[7], written for the high-low subblock, 



5 0c 

R'L',RL 



(k) = S 



0 

R'L',RL 



{k)+ 



0 

R'L',R"L" 



(k)o!Rii L" Srii RII rr{h) 



R"L" 



(36) 
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is truncated at = ^max- Therefore, having the low- low snbblock of the 
screened structure constant and using the high-low unscreened matrix ele- 
ments, we can determine by simple matrix multiplication the high- 

low block of the screened structure constants. 

The one-centre form of the charge density can be obtained from Eq. (34) 
and (35). Using a compact notation for the radial functions, i.e. 



<pRi{<^,rR) = 



VRi{(-,rR) if I < 

^max 

-jiif^^rR) if I > 

^maxi 



(37) 



we obtain the following expression for the partial components of the full 
cliarge density 



nRLivR) = URL'<Pm'{ej,rR), (38) 

L"L' j 

where Cr,,r, are the real Gaunt numbers. In (38) both I” and I' summations 
includes all the terms up to > Imax- Similar expressions valid for the 

LAITO basis set are given in Appendix C. In practical applications we found 
that for an accurate representation of the non-spherical charge density 
should be about 8—12 depending on the structure. 

When both the overlap correction (33) and the high I' tails (34) are inclu- 
ded in the charge density besides the terms from (38) or (64) and those from 
Ref. [20] one has to include the cross terms, fRL"jL' and jL" /rl' as well. 

In Fig. 4 we present charge density contour plots for hexagonal graphite 
calculated at the theoretical equilibrium volume using Eq. (64) given in Ap- 
pendix C. From a comparison of the present plots withthose calculated using a 
full-potential method [36] , we can conclude that the effect of the non-spherical 
potential terms, neglected in the present self-consistent LMTO calculation, 
have minor effects on the valence charge distribution. The ASA potential de- 
scribes very well the strongly covalent double humped character of the C-C 
bonds in the hexagonal graphite. Futher examples of the application of the 
above technique for the calculation of charge densities are given in Refs. [20, 
37]. 

Finally we note that, because the high I' tails and the shape of the cell 
are neglected during the self-consistent iterations, the total charge density 
given by (5) and (38) will not be exactly normalized within the Wigner- 
Seitz cell. This difficulty can be overcome by a simple spherical symmetric 
renormalization of the charge density within the cells. 

4 Shape Function Techniqne 

In order to calculate the different energy terms discussed earlier, we need 
a technique to integrate over the W-S cell. To carry out this integration 
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Fig. 4. Charge density contours (in electron/a.u.®) of the hexagonal graphite within 
an atomic layer (left) and in a plane perpendicular to the layers (right). The density 
was calculated by the TB-LMTO using the one enter form (64). 

we use the so-called shape function technique [2,17,32] which was recently 
implemented by Drittler et al. [38] in their development of a full-potential 
Korringa-Kohn-Rostoker multiple scattering method. An alternative scheme 
to treat interstitial integrals has been proposed by Savrasov and Savrasov [15] . 
Here, we apply the original version of the method and define the following 
shape function 
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1 for yr e fiR 
0 otherwise 



which can be expanded in terms of real harmonics 



(39) 



o-fl(rfl) = ^o-RL(rfi)YL(rfl). (40) 

L 

Here ctrl^yr) are the partial components of the shape function. By means of 
the shape function any integral over the W-S cell may be transformed into 
an integral over the sphere which circumscribes the cell, i.e. 



/ nR{YR) K[n] dvR = / o-fl(rfl) rifi(rfl) K[n] dvR. (41) 
Js2r •'^r 

Here K[n\ can be an arbitrary functional of electron density, for example, the 
kinetic and exchange-correlation energy density from Eqns. (10,13,14), or the 
Coulomb potential from Eq. (16), etc. The quantity <jr{yr) URijCR) can be 
expanded in terms of real harmonics 



(tr{vb) nR{vR) = '^TiRL(;rR)YL{rR), (42) 

L 

where the partial radial functions represent the Yl{yr) projections of the 
charge density on a spherical surface that lies inside the Wigner-Seitz cell. 
These functions are used in Section 11 in the evaluation of the Coulomb 
energy. In terms of the partial components of the shape function and of the 
charge density they can be expressed as 

nRL{rR)= ^ Cl,j^nnRL'{rR)aRL'^{rR), (43) 



where are the real Gaunt coefficients. 

4.1 Evaluation of the Shape Functions 

In order to determine the partial components of the shape function (39) 



RRL{rR) 



RR{TCR)YL{YR)dYR, 



(44) 
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we divide the polyhedron, generated by the well-known Wigner-Seitz ap- 
proach, into Nt tetrahedra, and we choose a local coordinate system for each 
tetrahedron. Thus the two dimensional surface integral in (44) may be per- 
formed only for the non-equivalent tetrahedra of number N„, and the total 
shape function is obtained as 

7V„ iVc(t) 

cTRLirn) = ( 45 ) 

t i m' 

where D\^ arc the matrix elements of finite rotations defined, for example, 
in Ref. [39], and ai, /3i,^i are the Euler angles of the local coordinate system 
associated with the tetrahedron i. In (45) Ng(t) denotes the total number of 
tetrahedra of type t. i.e. iVe(t) = Nt- 

The angular integration for each non-equivalent tetrahedron was perfor- 
med by integrating analytically over 9 and numerically over (f>, as described 
by Stefanou et al. [40]. In this way we managed to achieve both high accu- 
racy and efficiency, and thus develop a general algorithm for determining the 
shape function from the neighboring lattice vectors for an arbitrary structure. 

In Fig. 5 we show the L = (0, 0), (4, 0), (4, 4), (10, 0) and (14,0) partial 
components of the shape function for the hcc structure as a function of r in 
units of the lattice constant a. Apart from the spherical component ct(o,o)(^)i 
all the other terms are zero inside the inscribed sphere and outside the circum- 
scribed sphere. The partial components of the shape function and their deri- 
vatives have kinks at the points where the sphere of radius r passes through 
a face, an edge or a corner of the polyhedron. In order to ensure the required 
accuracy in the radial numerical integration we take these points as mesh 
points. 

The partial wave expansion of the shape function (40) oscillates strongly 
and its convergence towards the step function is rather slow. The insert in 
Fig. 5 shows the relative error of the volume between the Wigner-Seitz cell 
and the inscribed sphere, calculated as 

= / '^(r) (j(r) dr - a{r) a{r) dr 

Js^ Js^ 

= E/. dr = J2di- (46) 

L I 

As it can be seen from the figure the relative error for Imax = 30 is still 
very high (4.2%). However, the partial components exhibit several oscillations 
within the interval r G (S'*, S'*) and the number of oscilations increases with 
the orbital quantum numbers. Therefore, the quantities derived from the 
shape funetion by integration like (41) are well behaved, as it can be seen 
from Fig. 2, where the Hartree energy of fee Cu is shown as a function of Imax 
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Fig. 5. The partial components of the bcc shape function for L = 
(0, 0), (4, 0), (4, 4), (10, 0) and (14,0) as functions of the radius. In the insert the 
relative error (F^-s* ^ (®cc Eq. (46)) is shown for different Imax 

values. 



used in (17) or equivalalently in (43). The figure illustrates that a resonable 
accuracy is achieved already for Imax = 8 — 12. 



5 Discussion 

In this section we review and discuss through several examples some impor- 
tant calculational details related to the FCD total energy calculations. In the 
self-consistent procedure we solve the scalar-relativistic Dirac equation using 
either the hamiltonian or the Green’s function formalism. In the former de- 
scription the so called combined correction term [3,5,9] is included. The core 
electrons are treated within the frozen core approximation, and the semi-core 
states are included in a second energy panel. In this panel all the I chanels, 
except that corresponding to the semi-core states, are downfolded [7,8]. In the 
upper energy panel the inactive chanels are dow-folded. This procedure ac- 
counts correctly for the important weak hybrization in the occupied parts of 
the bandstructurc and reduces the rank of the eigenvalue problem to that of 
the number of active orbitals. Moreover, in this way we manage to avoid the 
numerical difficulties that arises by fixing the expansion parameter Ci, m in a 
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different position than the center of the occupied part of the corresponding 
chanel. 




Fig. 6. Equilibrium Wigner-Seitz radii of the light actinides obtained by the full 
charge density method. The calculations are performed in the LDA or the GGA 
for the crystallographic a-phases indicated in Pearson notation at the top of the 
figure. The measured room temperature values are corrected to T = OK using the 
measured mean thermal expansion coefficients. 



As an illustration of this problem we consider the light actinides in the 
low temperature a phases. The uncorrected ASA fails [37] in the case of these 
open systems and, therefore, a more accurate and, at the same time, efficient 
approach is needed. In Fig. 6 we show the equilibrium Wigner-Seitz radii for 
the light actinides calculated by the present method using either LDA [41,42] 
or GGA [29] exchange-correlation energy functionals. The crystallographic 
a phases are indicated in the top of the hgure. To obtain fully converged 
binding energy curves with a proper minimum for the light actinide metals 
in the FCD method the 6p states must be included in the hrst energy panel 
and the 7p states in a second energy panel. In the hrst panel we down- 
folded the s, d and / states and in the second only the p states. For all the 
active and inactive orbitals the Cu was hxed at the center of the occupied 
part of the corresponding band. We have found that the GGA results are 
on average only 1.3% larger than the zero temperature experimental values. 
This slight overestimate of the equilibrium volume is a common feature of 
the GGA functional observed in the case of the transition metals as well [22] . 
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We note that the agrement between the LDA results and the experimental 
values is very good at the beginning of the series but the experimental trend, 
reproduced by the GGA, is not properly described by the LDA functional. 
In order to see the effect of the inactive 7p band we fixed e^, jp at the center 
of the 7p band, thereby reducing the theoretical LDA atomic radius for Th 
by 2%. Throughout the actinide series the 7p band ascends and the errors 
related to fixing e^, become smaller. Hence, an incorrect paramctrization of 
the LMTO basis can alter the theoretical trend provided by the FCD method. 

The number of orbitals or basis functions that should be considered in the 
solution of Schrddinger’s equation for a crystal potential is one of the basic 
problems in any band structure calculation method. A low cutoff in the orbi- 
tal quantum number, i.e., a low Imax in (26), prefered in the secular equation, 
does not always quarantee basis set convergency. Regarding the present me- 
thod, in several applications we found that the total energy evaluated form 
the total charge density represents a better basis-set convergence compared 
to the ASA total energy. In other words, in the case of the transition metals, 
for example, the inclusion of the / orbitals has a relatively small effect on the 
FCD total energy, while the ASA total energy can be affected by about 40% 
[17,18]. This is illustrated in Fig. 7, where we plotted the surface energies of 
the Ad transition metals obtained for the close packed fee (111) surface by 
means of the LMTO- ASA and FCD methods. The two sets of results corre- 
spond to s,p, d and s,p, d, f basis sets. The isotropic experimental values are 
taken from Ref. [43]. The ASA siirfaee energies and the surface full charge 
densities, needed for the FCD surface energies, were calculated by means of 
the self-consistent surface Green’s function technique implemented by Skriver 
and Rosengaard [44]. Details of this calculation can be found in Refs. [17,23]. 
As it can be seen from the hgurc, the two sets of FCD results are very close 
to each other, and in fact they follow the experimental trend. In the ASA 
calculation the inclusion of the / orbitals has a strong negative effect in the 
total energy of surface layers. This is due to the fact that in a conventional 
ASA method the tail expansion (29) is truncated at the same Imax as is used 
for the muffin-tin orbitals. In the FCD technique the truncation in Eq. (29) 
is always higher than the one in Eq. (26). 

The kinetic and exchange-correlation energy functionals used within the 
FCD technique involve the functional derivatives of the energy densities, 
see Eq. (9). The first order functional derivative of the exchange-correlation 
energy, for example, is the well known exchange-correlation potential, pxc, 
etc. Some of the parametrized density functionals, however, have different 
analytical forms for different densities or density gradients, which matches 
continuously. As a result, they may display an artificial discontinuity in the 
second or higher derivatives with respect to density or gradient and cannot 
be applied in conjunction with (9) and (10) which requires simple analytic 
representations of the exchange-eorrelation and kinetic energies. Therefore, 
in the actual applications the exchange-correlation energy is evaluated using 
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Fig. 7. Comparison of the surface energies obtained using the full charge density 
LMTO and the LMTO-ASA methods for the fee (111) surface of the 4d element. 
The dashed lines connect the result obtained on the s,p,d basis set, and the solid 
lines those obtained by inclusion of the / orbitals. The experimental results by de 
Boer et al. are also shown. 



either the LDA functional by Perdew and Wang [45], or the GGA functional 
by Perdew et. al [29,46], while the kinetic energy correetion, Eqns. (10-13), is 
evaluated using either the gradient expansion (47) or the Pade’s form (52). 

In the PCD technique Poisson’s equation is solved exactly and, hence, the 
Coulomb energy is calculated exactly from the total, non-spherical charge 
density. Moreover, within the LDA or GGA the exchange-eorrelation energy is 
also evaluated exactly, and the kinetic energy is corrected for the nonspherical 
effects. However, the ASA part of the spherical kinetic energy contribution 
(11) reflects the accuracy of the ASA potential, e.g., how well the spherical 
part of the full-potential is described by the ASA potential inside the atomic 
sphere. In open structures with large overlap between the atomic spheres the 
ASA does not work well and one has to introduce empty spheres into the 
structure to reduce the overlap and maintain the accuracy of the ASA. As an 
example of such a system we consider hexagonal graphite, where we include 
8 empty spheres in the unit cell in addition to the 4 C spheres. 

The contour plot of the graphite charge density is shown in Fig. 4. Inclu- 
ding 8 empty spheres the true charge density, obtained by the full potential 
method [36], is equally well reproduced in the C-C direction, in the centre 
of the hexagonal arrangement of the C atoms and between the atomic layers 
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c/a 

Fig. 8. The total energy of hexagonal graphite as a function of the c/a ratio obtai- 
ned by means of the full charge density LMTO method. 



as well. The equilibrium Wigncr-Scitz; radius and bulk modulus obtained by 
the present ealculation are 1.601 a.u. and 3.32 Mbar within the LDA, and 
1.618 a.u. and 3.07 Mbar within the GGA, respectively. These numbers are 
only in moderate agreement with the experimental values, 1.67 a.u. and 2.86 
Mbar [36], and with the full potential results from Ref. [47], 1.68 a.u. and 
2.36 Mbar. However, when the c/a ratio is considered, our LDA value shown 
in Fig. 8 is in very good agreement with both the experimental, 2.72 and full 
potential value, 2.77 from Ref. [36]. The c/a ratio in the present work was 
calculated by minimizing the total energy with respect to the c/a at constant 
volume, like in Ref. [36], and, therefore, it reflects only partially the C-C 
interlayer bonding. 



6 Conclusions 

We have presented a full charge density technique based on the complete 
charge density from a self-consistent LMTO calculation employing a spheri- 
cally symmetric ASA potential. In the calculations, besides the exact Cou- 
lomb and excha- 
nge-correlation terms within LDA or GGA, we include a correction to the 
ASA kinetic energy, which means that we evaluate the true functional rat- 
her than an ASA functional. The technique has been tested through several 
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calculations for systems where the conventional ASA method fails [18,20,22, 
23,37]. The comparison with the experimental values and with full poten- 
tial results shows that the FCD technique has an accuracy similar to that 
of a full potential description, while the required computational effort is not 
signihcantly larger than in conventional spherically symmetric LMTO-ASA 
calculations. 
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8 Appendix A 

The starting point for the kinetic energy correction is the density-gradient 
expansion of the nonintcracting kinetic energy functional [28] 



T[n] = r(°)[n] -t r(2)[n] -f T(^>[n] + ... (47) 



with 



rj^(2k) 






(48) 



Here, is a kinetic energy density which ( in atomic Ry units ) has the 
following explicit forms 



tiO) = f(37r2)2/3„5/3_ 

5 



(49) 



^(2) ^ 1 (V^)^ 



36 n ’ 



(50) 






370(37t2)2/3 



) 2 / \ / \2 / \ 4 ' 

9 / W Vn \ 1 / Vn \ 

8y n y ^3in j 



(51) 



for k = 0,1,2. This expansion is valid for slowly varying or high densities, 
i.e. when s = |Vn|/n^/® 1. For high values of s the expansion diverges 
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and, therefore, other approximate functional forms should be considered. In 
the locally truncated expansion suggested in Ref. [48] the number of terms 
included in the series is determined in each point in space by a local criterion 
based on the properties of an asymptotic series. In practice, this means that 
in regions of high gradients the functional reduces to the Thomas-Fermi form, 
which, obviously, represents a poor approximation to the real kinetic energy 
density. In Ref. [49] wc showed that the local inclusion of the von Weizsackcr 
term, t\v = rather than the Thomas-Fermi term, gives resonable accu- 

rate results in the large gradient limit as well. The parametrized form of this 
kinetic energy functional is 



is, 2 



tW 



1 + 0.95.T + 9ffl.x'3 
1 — 0.05x -I- ax^ 



(52) 



where x = (5/27) s^. The parameter a was determined using the exact Kohn- 
Sham kinetic energies for the jellium surface [50], and we found a = 0.396 
[49]. 



9 Appendix B 

To find a reasonable choice for the displacement vector introduced in Eq. 
(20) wc have calculated the electrostatic interaction energy of two truncated 
spheres of radius 5"^ with a uniform charge distribution no, separated by a 
distance d [34]. In the case of overlapping bounding spheres, d < 25"^, the 
intercell energy (20) is given by 



{i + i' + ry. 1 

l\l'U"\ (6d + d)''+'"+i^' 



(53) 



For nonoverlapping spheres, d > 2S‘^, this expression reduces to the form 






(54) 



1 , 1 ' 



where Qi stands for the l-th multipole moment 



Qi 



27tTl() . ^c\Z+3 

1 + 3^^ ^ 



r 2 s<' 2nnn f d \ 

^ (-) P,(x)<lx. (55) 

' J- '' O OC \ / 
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Here P;(a;)’s are the Legendre polynomials. We mention that owing to the 
axial symmetry of the system the multiple moments vanish for m / 0. 

The necessary condition for the convergency of the outer sum in (53) is 
the convergency of the inner sums over I' and I" for each value of 1. The 
diagonal terms of the inner sum in (53): 



Ai'i' 



= Qv 



{l + 2l')\ 1 

1!(1'!)2 (6^ + d)2i'+i 



have a maximum around V = I for 1,1' 1 



(56) 



l{l) = I 



Qi+1 

Qi 



ba + d- 2%^ 



(57) 



From (54) we can estimate the upper limit for the ratio of the multipole 
moments since this sum is always convergent for d > 2S"' (similar result 
can be obtained from (55)). Therefore for higher I values we have 



Qi+i 

Qi 



< S" 



(58) 



and for I wc obtain 



l{l) < I 



S'" 



bd + d- 2S" 






(59) 



Thus we see that the individual terms in the inner sum show a maximum, 
which (strictly speaking its upper bound) is proportional to I and the coeffi- 
cient a depends on bd- In order to ensure the required accuracy of the inner 
sum it is reasonable to assume that the summation should be carried out at 
least up to I'max — ^ 2l[l) for any 1: it is easy to show that in this case 

for the neglected terms in the inner summmations <1%. This should 
hold for the largest value of I as well, i.e. Imax = o^^maxi we obtain the 
relation 



bd + d={l + a)2S". (60) 

The validity of this choise of the displacement vector should hold for realistic 
systems as well because in (58) for the ratio of the multipolc moments wc use 
an estimate which is independent on the shape of the cell and on the charge 
distribution. We note that Eq. (24) is a generalization of (60). 
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On the basis of (60) we can explain the existence of the range of con- 
vergence in the outer summation. For a fixed value of I'^ax it is obvious 
that with increasing I, we start to neglect significant terms in the inner sum- 
mation if -Z > which may lead to the divergence of the outer sum. 

Therefore an upper limit of the range of convergence in I may be defined as 
^'max {^20^ ~ i)' Thus with increasing bd the range of convergence becomes 
wider in accordance with the observation by Gonis ef al. ([33]). 

10 Appendix C 

In the linear muffin-tin orbitals method [1,2, 3, 4, 5, 6 , 7, 8 , 9] the lower part of the 
tail functions are substituted (taking into account the normalization function) 
by the first energy derivative of the partial wave, ip'^ calculated at the center 
of the energy range of interest, In this way the LMTO’s become energy 
independent up to the second order and the summation over the occupied 
states in (38) can be evaluated separately. The energy dependence of the 
envelope functions within the LMTO formalism is fixed at = 0. In the 
following we use the nearly orthogonal representation ( 7 ) but the relations 
can easily be generalized to other representations. 

The second order expression for the wave function (34) that includes the 
high tail contributions becomes 



L' 

+ Y^3r'L'{^R')~^R'L'^ ( 61 ) 

L' 

where (p denotes the second order energy derivative. The first summation 
includes terms up to the Imax < 2(or 3), while the second summation includes 
all the high I' tail components as well. In (61) we have introduced a compact 
notation for the tail functions 



jR'vi.fR') 



and for the eigenvectors 



[^h’vi'rR’) lil' < ^max ) 
\ji'{rR') ifl'> ^max 



U 



3 

R'L' 



- 



■j 

J2rL ‘‘^R'L',RL 






‘■RL 



if / ^ ^max 1 
if / 7 Imax 



(62) 



(63) 



In (61) the partial waves, their energy derivatives and the energy derivative 
of the potential function are calculated at e = e^. Now we can write the 
expression for the partial components of the full charge density, 
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n-RL — C^nj^ilipRl'iipRl'mPfj^j^,,]^, + ^RV'pRli'm^Rj^nl^, 

L"L' 

+ 2‘^Rl"‘fRl’^RL"L' + ^pRl"^Rl'ij^\iL’L")* 

+ 2‘^Rl''y^Rl'{RT''RL'L")* + jRl''jRl'^\L"L']> ( 64 ) 

where for simplicity we have neglected the radial variable vr, and 



RL"'^itL' in",!' < Imax, 

3 

occ. 

mRL"L' = U^RL"UrL' if < ^niax, 

3 

occ. 

’^RL"L' ^ ^ RL"i^j ~ Rl') U^RL' if ^ 3 3: Imax, 

3 

occ. 

^^RL"U = y~! ^RL"^RL' 

3 

In (64) the I" and I' summations include the high tail components as well, 
however, for these high indices the tp and ip should be set to zero. 

The high-low block of the structure constant 5"*' can be calculated using 
the Dyson equation 

^R'L',RL = ^R'L',RL + ^ ^R'L' ,R"L"{lR"l" ~ ^R"l")^R"L" ,RL^ (65) 

R"L" 

where 7 _r"/" = aRmn = 0 for I" > Imax, and the high-low subblock of the 
tight-binding LMTO structure constant 5“ is obtained from (36) written for 

R? = 0 . 
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Abstract. The paper presents a recently developed full-potential linear muffin-tin or- 
bital (FP-LMTO) method which does not require empty spheres and can calculate 
the forces accurately. Similar to previous approaches, this method uses numerical in- 
tegration to calculate the matrix elements for the interstitial potential, which is the 
limiting step for any FP-LMTO approach. However, in order to reduce the numerical 
effort as far as possible, we use a newly introduced basis consisting of “augmented 
smooth Hankel functions” which play the role of the LMTO envelope functions. After 
presenting the basics of the approach, we report the results of numerical test for typical 
condensed-matter systems. The calculations show that good accuracy can be reached 
with an almost minimal basis set. These features of the method open the way to ef- 
ficient molecular dynamics studies and simulated-annealing calculations to optimize 
structures while retaining the advantages of the LMTO method. 



1 Introduction 

The linear muffin-tin orbital (LMTO) method [1] has played a very successful 
role among the various techniques for solving the density-functional equations [2] 
for a condensed-matter system. Two characteristic features of this approach are 
(i) the use of atom-centered basis functions of well-defined angular momentum, 
constructed out of Hankel functions, and (ii) the use of augmentation to in- 
troduce atomic detail into the basis functions in the vicinity of each nucleus. 
Overall, the rationale behind this approach is to construct basis functions which 
closely resemble the actual wavefunctions from the very beginning. The conse- 
quence is that a comparatively small basis set already leads to a converged total 
energy, hopefully giving rise to substantial reductions in the computation time 
and the storage requirements. 

In general term, when using a sophisticated basis in this manner, it is not 
a priori clear that these savings will actually be the realized. The price for a 
more sophisticated basis set is an increased computational effort in some steps 
of the calculation. This extra effort may or may not cancel out the gain due to 
the reduced basis size. For the LMTO method, the balance is no doubt positive 
if the atomic-sphere approximation (ASA) [3] is used. Hereby the single-electron 
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potential is modeled by a superposition of spherical potentials inside overlap- 
ping space-filling spheres. Where this approximation is applicable, the LMTO- 
ASA method is presumably the most efficient procedure available for solving the 
density-functional equations to a reasonably high degree of accuracy. However, 
a “full-potential” treatment which goes beyond the ASA is needed for many 
systems of interest. Typical examples are the total energy changes associated 
with phonons distortions and atomic relaxations, say at a surface or around an 
impurity. Furthermore, for low-symmetry situations the LMTO-ASA method 
becomes unwieldy as the question of “empty spheres” to improve the packing 
fraction becomes more important. Finally, since the energies associated with such 
distortions are not reliable, the question of the calculation of the forces on the 
atoms does not even arise. However, the forces are precondition for simulated 
annealing and molecular dynamics studies in the spirit of the Car-Parrinello [4] 
method. 

A number of different approaches have been developed in the past to go 
beyond the ASA in the LMTO method. Their common element is that the po- 
tential is treated correctly within the existing LMTO basis set. Unfortunately, 
this has lead to a substantial increase in the computational effort. In the inte- 
rest of efficiency, we have explored and implemented a novel approach in which 
the LMTO basis functions themselves are modified in a controlled manner. This 
has two benefits: the basis set can be even smaller, and the effort for a nume- 
rical integration of the potential matrix elements is reduced. At the same time, 
a reformulation of the augmentation procedure is required since the standard 
structure-constant expansion cannot be used for the modified functions. Tur- 
ning this into an advantage, we have used a “projection” description which is 
partly based on previous developments due to Blochl [5] and Vanderbilt [6] but 
which includes some new features. Among other benefits, the separation into 
a “smooth” and atomic “local” terms is cleaner, leading to a straightforward 
expression for the forces. 

Overall, the following criteria were considered imperative when developing 
this variant of the LMTO method: 

• The forces must be calculated. 

• All reasonable geometries should be handled without the need for empty 
spheres. 

• It must be possible to improve the accuracy systematically by turning up 
the various convergence parameters. 

Within these requirements, it was tried as far as possible to maintain the cha- 
racteristic advantages of computational efficiency and small memory demands. 

In the rest of this paper, the modified LMTO envelope functions are discussed 
in Sect. 2.2. The reformulated augmentation procedure and the force theorem 
are described in Section 2.3. Finally, we present some practical experience with 
the resulting method in Section 3. Specifically, we investigate the dependence of 
the results on the various convergence parameters and compare to calculations 
with other methods for a number of realistic systems. 
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2 Description of the Method 

This section presents the basic ideas behind the method. After a discussion the 
central role of the interstitial potential matrix elements, the modified LMTO- 
like envelope functions (smooth Hankel functions) are presented. Finally, the 
reformulated augmentation procedure and the forces are discussed. 

2.1 The Central Role of the Interstitial Potential Integrals 

In a density-functional calculation, the three main tasks are (i) to solve the 
Poisson equation and to add on the exchange-correlation potential, making the 
effective potential felt by an electron, (ii) to solve the single-particle Schrodinger 
equation, and (iii) to add together the squared moduli of the wavefunctions to 
accumulate the new output density. In most cases, the solution of Schrodinger’s 
equation is the most difficult and the most expensive step of these. In fact, if 
the potential is to be treated without any further approximation (ie., in a “full- 
potential” method), the main bottlenecks usually turn out to be in two closely 
related substeps: 

• To set up the Hamiltonian matrix, we require the matrix elements of the 
effective potential V),ff('^) for the basis functions Xii^)- 

Vij = J Xi{r)VeH{r)xj{r) dr . (1) 

• To obtain the output density, we must sum over the squared moduli of the 
wave functions ^„(r): 



n-‘(r) = (2) 

nocc 

Since the wavefunction is a linear combination of the basis functions, the real 
task when making the output density is to express the product of any two basis 
functions XiXj in n form suitable for further handling. Specifically, we must be 
able to evaluate the integral of this product times the potential, since this is 
one important contribution in the total energy. Furthermore, the representation 
must “close the loop” so that the output density can be mixed with the input 
density and the result fed into the next iteration. In practice, both of the steps 
above arc essentially the same problem, namely the evaluation of the potential 
integrals of (1). 

In augmentation approaches such as the LMTO and linear augmented plane- 
wave (LAPW) methods [1], space is partitioned into atom-centered mufhn-tin 
spheres and an interstitial region. The potential integrals split up accordingly 
into the integrals over the two types of region. The contributions from the muffin- 
tin spheres can be calculates in a straightforward and reasonably efficient manner 
since the potential as well as the basis functions can be expressed using one- 
center expansions. That is, each function is written in polar coordinates around 
the relevant sphere center as a radial part times a spherical harmonic. 
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Thus, the term which requires most attention is in fact only the interstitial 
contribution to the potential matrix element: 

= I H*ir)V{r)H,{r)dr . (3) 

JlR 

Here the functions Hi[r) denote the envelope functions, z.e., the analytic func- 
tions which will be augmented inside the atomic spheres to obtain the final basis 
functions Xi(r), and IR denotes the interstitial region. This matrix element is 
always problematic, independent of the representation chosen for the interstitial 
potential. In fact, a major part of devising a viable method is to select a repre- 
sentation for the density and the potential which makes it possible to compute 
these matrix elements reasonably efficiently. The efficiency of the LMTO-ASA 
method is clear: by making the atomic spheres space filling and neglecting the 
interstitial potential, the most demanding computational step is eliminated, lea- 
ving only terms which can be evaluated in a compact and effective manner. 

Basically there are two ways to proceed in a full-potential approach. First, 
the potential in the interstitial region can be expanded in some suitable set of 
auxiliary functions. For example, a set of atom-centered functions can be used, 
which can lead to a very compact representation. Substituting the expansion 
under the integral in (3) leads to a sum of integrals, each over a product of three 
terms. Unfortunately, such integrals can almost never be evaluated in closed 
form (the notable exception is when all terms are gaussians). In our context, 
two of the terms are LMTO envelopes, and no reasonable choice of a potential 
expansion considered to date leads to a closed form. 

Alternatively, the interstitial potential can be specified by tabulating it on a 
regular mesh which extends through the unit cell. Equivalently, the coefficients 
of the Fourier expansion of the potential can be given. Although such a nume- 
rical representation requires more data to specify the potential, there are some 
considerable advantages. First, since the exchange-correlation potential must be 
evaluated point by point, a mesh representation is needed at some stage in any 
case. Second, by using a regular mesh the Poisson equation can be solved easily 
using a fast Fourier transform. Third, the additional step of htting the output 
density or the effective potential to the set of auxiliary functions is avoided. 
Finally, enhancements such as gradient corrections can be implemented more 
easily on a regular mesh than in a more complicated representation. For these 
reasons, a real-space mesh representation of the interstitial potential (as well as 
the interstitial density) will be used in the following. In effect, we have opted for 
a hybrid treatment in which the wavefunctions are represented using a carefully 
constructed atom-centered basis set but the interstitial potential and density are 
given as numerical tabulations. 

At this point, a comparison to different existing methods is in order. The 
generally used procedTire is to extend the basis functions and the interstitial 
potential smoothly through the atomic spheres in some manner, to integrate (3) 
over the complete unit cell using these smooth functions, and finally to subtract 
off the unwanted contributions inside the spheres in conjunction with the aug- 
mentation step. In the full-potential LAPW method, all the involved functions 
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are simply plane waves and the integral over the cell can be written down im- 
mediately. In the various FP-LMTO approaches, the smooth extension must be 
explicitly constructed for the sphere on which the function is the centered. This 
can be done by matching an analytical expression (such as a polynomial) at the 
sphere radius [7,8] or by using analytical functions similar to ours with a suitable 
choice of the parameters [9] . In either case, care must be taken that the resulting 
function strictly equals the standard LMTO envelope in the interstitial region. 
The alternative approach followed in this paper is to “bend over” the envelope 
functions already in the interstitial. As discussed next, this reduces the basis 
size and at the same time permits a coarser mesh for the numerical integration. 
We note that a completely different approach to the interstitial potential matrix 
elements is to re-expand the product of any two envelopes as a sum of auxili- 
ary atom-centered basis functions. In this way the integrals over three factors 
in (3) arc changed to a sum of two-center integrals. The expansion can be ob- 
tained approximately by fitting on the surfaces of the muffin-tin spheres [10] or 
more accurately for molecules by tabulating the results of a careful numerical 
fit [11]. Furthermore, an approach suitable for molecules and polymers has been 
developed which handles integrals over the product of three Hankel functions by 
expanding two of the terms around the site of the third [12]. 

2.2 Smooth Hankel Functions 

In the following, we present the advantages of an LMTO-like basis consisting of 
augmented smoothed Hankel functions. The standard LMTO envelope function 
is a Hankel function of a (usually) zero or negative energy parameter times a 
spherical harmonic. This object will be denoted as a “solid Hankel function” 
in the following. It solves the Schrodinger equation for a flat potential, decays 
exponentially at large distances if the energy parameter is negative, and has a 
singularity at the site where it is centered. The essence of our modification is to 
remove the singularity. The resulting “smooth Hankel function” is smooth and 
analytic in all parts of space. When such a function is used to construct the 
basis, the parameters can (and should) be chosen so that the functions deviate 
from the unsmoothed variants already outside the central atomic sphere. As will 
be explained below, this speeds up the calculation for two separate reasons: the 
basis can be smaller, and numerical integration can be done using a coarser 
mesh. 

Basic Properties 

The smooth Hankel functions (discussed in detail in Refs. [13,14]) are shown in 
Fig. 1 for angular momentum 0, 1, and 2. For large radii, the smooth function 
to each angular momentTim equals the corresponding standard Hankel function, 
showing the same exponential decay proportional to exp(— «:r), as specified by 
the negative energy parameter e = —k^. At smaller radii, the function bends 
over gradually until it finally approaches r* close to r = 0. When multiplied by 
the spherical harmonic YL(f), the result is analytic in all parts of space. 
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Fig. 1. Comparison of smooth and standard Hankel functions for l—O (continuous 
lines), 1=1 (dashed), and 1=2 (dotted lines). The energy e equals —1 and the smoothing 
radius i?am equals 1.0. For large radii, the smooth and standard functions coincide. Near 
the origin, the smooth function bends over gradually until it enters as whereas the 
standard function has a singularity proportional to 1/r*^^. 



Of some importance is the parameter i?sm, denoted as the “smoothing ra- 
dius” associated with the function. For practical purposes, the standard Hankel 
function and its smooth variant are equal where the gaussian exp(— is 
negligible, say for r > 3i?sm When Rsm is increased, the deviation from the 
standard function starts at a larger value of r and the resulting function is more 
strongly smoothed. Specifically, the value at r = 0 for f' = 0 becomes smaller as 
the former singularity is washed out more and more. 

A central distinction to the standard LMTO envelopes is that two separate 
parameters determine the shape of each function. More exactly, the energy para- 
meter determines the exponential decay at large radii, and the smoothing radius 
determines how strongly the function has been smoothed. Consequently, in order 
to tune the function to mimic the true wavefunction as far as possible, both of 
these parameters should be adjusted. 

As a basis set, these functions combine many of the advantages of Hankel 
functions and gaussians. In fact, a smooth Hankel function is a convolution of 
these two types of functions. Due to the exponential decay at large radii, they 
constitute a numerically more stable and more compact basis than pure gaussi- 
ans. In contrast to the standard Hankel functions, they have a smooth nonsin- 
gular shape near the origin. Furthermore, many important quantities (such as 
two-center integrals) can be evaluated analytically. 

Formally, the smooth Hankel functions are defined in the following way. The 
usual Hankel function for angular momentum zero is ho{r) = jr where n 
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defines the decay at large radii. As a function of r = |r| in three-dimensional 
space, ho satisfies the differential equation 

{A + e)ho{r) = -47r^(r) (4) 

where e = —k^ is the energy associated with the function, here always taken to 
be negative. Thus, A + e applied to ho is zero everywhere except at r = 0, where 
a delta function arises from the 1/r singularity of ho. Expressed differently, ho{r) 
is the response of the operator A + e to a delta-function source term. 

To change this standard Hankel function into a “smooth Hankel function” 
the infinitely sharp delta function is smeared out into a gaussian: 

{A + e)ho{r) = -Airgoir) ■ (5) 

By defining a suitable normalization of the modified source term 
goir) = C exp(— / E^^), the smooth Hankel approaches the standard function 
for large r. As r becomes smaller and reaches the range where go(^) is non- 
negligiblc, the function ho{r) now bends over smoothly and behaves as a constant 
times for r ^ 0. 

We will also need smooth Hankel functions for higher angular momenta in 
order to construct basis functions for the s, p, d... states. These can be construc- 
ted by applying the differential operator A’l(-'V) defined as follows. The spheric 
harmonic polynomial 3^(r) = t^Yl is a polynomial in x, y, and z., for example 
C[x^ — j/^). By substituting the partial derivatives —dx, ~dy, and —dz for x, 
y, and z, respectively, the required operator is obtained in a straightforward 
manner. Applying this operator to the delta function yields point dipoles, qua- 
drupoles and so on, and applying it to goir) yields smeared-out gaussian versions 
of these. Thus the L-th smooth Hankel functions is HL{r) = Tl(— V)/ io(x) and 
satisfies the differential equation 

{A + c)Hl = -4nGL{r) = -4^3^i(-V)so(r) • (6) 

A number of important quantities can be calculated analytically for these fun- 
ctions, including the overlap integral and the kinetic energy expectation value 
between any two functions. They can also be expanded around some point in 
the unit cell. For further details, see Ref. [14]. 



Advantages of Smooth-Hankel Envelopes 

The first reason for using the smooth-Hankel basis functions is that this re- 
duces the size of the basis set, leading to a substantial gain in efficiency. To 
make this plausible, note that the standard LMTO basis functions are in fact 
not optimal as a basis for representing the crystal or molecular wave functions. 
The main problem is that they are “too steep” in the interstitial region close to 
the muffin-tin sphere on which they are centered. This is illustrated in Fig. 2. 
The standard Hankel functions solves Schrddinger's equation for a flat poten- 
tial. However, when approaching a nucleus the true crystal potential is not flat 
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Fig. 2. Sketch to explain why smooth Hankel functions lead to an improved basis. For 
the flat potential Vo, the solution of the radial Schrodinger equation if'o is a standard 
Hankel function with a singularity at the origin. As the true potential V starts to feel 
the attractive nuclear potential, the correct wavefunction 'I' bends over. This beha- 
vior already starts outside the muffin-tin radius and is built into the smooth Hankel 
functions. 



but decreases as it feels the attractive nuclear potential. The curvature of the 
wavefunction equals the potential minus the energy which therefore becomes ne- 
gative. In response, the wavefunction bends over and changes from exponential 
to oscillatory behavior. By using smooth Hankel functions, this typical form is 
inherent in each basis function. 

This effect can be appreciated by inspecting the way in which the standard 
LMTO basis functions combine to describe a crystal wavefunction. Generally the 
basis set must include some slowly decaying functions together with others which 
are considerably more localized. In the course of the calculation these combine 
with opposite signs, in this way modeling the required change of curvature. Using 
smooth Hankel functions as envelopes, these already have the correct behavior 
and some of the additional localized functions can be left away. 

In practice, the amount of gain depends on the type of atom. For the impor- 
tant angular momenta, a tripled basis can often be replaced by a doubled set. 
Less important channels such as the d states in an sp atom can be described by 
one radial function instead of two. An overall reduction by a factor of almost 
two is sometimes possible. In the order(A^^) steps, the computation time in such 
a favorable case divides by eight. 




122 



M. Methfessel et al. 



The second major advantage to using smooth Hankel functions instead of the 
standard LMTO envelopes is that the time-consuming matrix elements (3) for 
the interstitial potential can be calculated more efficiently. As described above, 
the integrals can be obtained by integrating over the complete unit cell using a 
regular mesh and subsequently subtracting the eontributions inside the spheres. 
The danger when calculating three-dimensional integrals using a mesh is that 
the computational effort can easily dominate all other steps. To keep the effort 
manageable, it is of high priority to make the integrands as smooth as possi- 
ble. This can be done by using smooth Hankels as envelopes. As an example, 
consider silicon with a muffin-tin radius of 2.2 bohr. For the standard LMTO 
basis, the smoothing must be noticeable inside the MT sphere only, demanding 
a smoothing radius no larger than 0.6 to 0.7 bohr. Outside the central sphere, 
the smooth and conventional Hankel functions are then identical to acceptable 
precision. The required integration mesh spacing is approximately 0.35 bohr. If 
we permit the functions to bend over outside the MT sphere, we find that the 
optimal basis functions have a smoothing radius of about 1.4 bohr. For these 
functions, the integration mesh can be twice as coarse. Consequently the number 
of mesh points and the computational effort are divided by eight. 

At this point it should already be mentioned that in the hnal implementation, 
the matrix elements of the smooth potential are actually calculated in reciprocal 
space. While this is at hrst sight equivalent to a real-space integration, the 
important difference is that a different reciprocal-space cutoff can be used for 
each function. For integrals involving envelopes with a large smoothing radius, 
a small cutoff is adequate. This property is very important for systems such as 
an oxygen impurity in silicon. The O basis functions demand a fine underlying 
real-space mesh through the unit cell, but this is effectively only used for the O 
functions. The effort for integrals between Si functions can be calculated with 
the same effort as without the presence of the oxygen atom. 

Altogether, a modified basis using “smooth Hankel functions” combines two 
major advantages. Since these are more similar to the final wavefunctions, ade- 
quate convergence can be attained using a smaller basis set. Secondly, since each 
function is smoother, the mesh used to evaluate the potential integral can be 
coarser. These effects combine to a substantial saving in computer time. As an 
estimate (admittedly for an extremely favorable case), the integration mesh can 
be twice as coarse, leading to a saving of about (1/2)^ = 1/8 in the order-3 
steps. The basis set will contain something like one-half of the functions needed 
without extra smoothing, giving another factor of approximately 1/8. Together, 
the computer time is divided by 64. While the overall gain will not be as large 
for many systems, a speedup by a factor between 10 to 20 is realistic. 

Analytical Two-Center Integrals 

In the preceding, it was explained how the smooth Hankel functions help to 
evaluate the matrix elements of the interstitial potential more efficiently. In the 
course of a calculation, we also require the matrix elements of the kinetic energy 
operator and the overlap integrals. A major advantage of the smooth Hankel 
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functions is that these integrals can be evaluated analytically. In fact, most two- 
center integrals involving these functions as well as gaussians can be obtained in 
basically the same way. The idea is to use Parseval’s equality: 

I Mr)* f2{r) dr = I fi{q)f2{q)dq ( 7 ) 

and the explicit expression for the Fourier transform of a smooth Hankel function 
H^^r) located at some site R: 

HUq) = ( 8 ) 

6 Q 

where 7 = Rg^/4: is one-fourth of the squared smoothing radius. When two such 
expressions are multiplied together, the result can be readily written as a sum of 
terms of the same basic form, combined with additional powers of arising from 
the Clebsch-Gordon factorization of 3fL(— ig)*TL'(— iq)- Since the phase factor in 
the product is cxp[iq • [Ri — R 2 )], the final result is that the desired integral can 
be expressed analytically as a sum of smooth Hankel functions, evaluated for the 
connecting vector between the two sites. Hereby the extra powers of mean that 
the (only slightly more complicated) functions AHl, A^Hl ■ • ■ are also needed. 
Furthermore, the resulting expression for the two-center integral is equally valid 
for molecular and Bloch-summed functions. In the later case, the only difference 
is that Bloch functions are also substituted in the final analytical expression. In 
addition to the overlap integrals considered here, integrals involving any power of 
the kinetic energy operator — zA as well as Coulomb integrals can be calculated 
analytically in a similar manner. Another related application is to obtain the 
coefficients of a local expansion around another site, as described next. 

Expansion Around a Site 

In order to do perform the augmentation in practice, one of the steps which is 
needed is to expand a smooth Hankel function around some point in the unit 
cell. Far away from the center, the smooth Hankel function equals the unsmooth 
variant and the well-known structure constant expansion for the standard Hankel 
functions could be used. On the central sphere, the function is given explicitly by 
its definition. It is for sites close to the central sphere, such as nearest-neighbor 
atoms, where something new is needed. Here the function generally starts to 
bend over and the standard expansion does not apply. 

This problem is solved as follows. We define a family of higher-order gaus- 
sians Gfei(r) by applying differential operators to the seed function go{r) = 
C'exp(-rVi?2^); 

Gfei(r) = AMd-MMr) ■ (9) 

We can construct biorthogonal polynomials to these functions, ie., a set of 
polynomials PkL{i') with the property 



GkL{r)Pk'L'{r) dr = Skk'^LL' ■ 



(10) 
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In fact, it turns out that PkL is just GkL divided by go[r) times a normaliza- 
tion constant. To expand an arbitrary function /(r) as a sum of the PkL, each 
coefficient can be calculated by the integral over /(r) times the corresponding 
gaussian: 



f{r) =^AkLPkL{r) ( 11 ) 

kL 

where 

Ak^ = J f{r)GkL{r)dr . ( 12 ) 

This expansion, when truncated to some low value of k, is considerably more ac- 
curate than, for example, a Taylor series. This is because the expansion converges 
smoothly towards /(r) in the range where ,9o(^) is large as more terms are in- 
cluded. When /(r) is a smooth Hankel function centered anywhere in space, the 
integrals defining expansion coefficients can be done analytically. This supplies 
the desired local expansion. 

The expansion is used in several different steps, most prominently to augment 
the envelope functions. Xote that in this procedure, there are two distinct para- 
meters which influence the accuracy of the expansion. By choosing a cutoff Pmax 
for the terms in the expansion, the radial function is represented as a polynomial 
of order Pmax- The range over which the expansion is accurate is determined by 
the smoothing radius i?sm of the projection gaussians Gll- When i?sm is chosen 
larger, the expansion can be used over a larger part of space but will not be 
as accurate overall for the same value of Pmax- Choosing /?sm in the vicinity of 
one-third of the muffin-tin radius will usually give a reasonable expansion within 
the muffin-tin sphere. 

2.3 Augmentation 

In the following, we describe the modihed augmentation procedure used in the 
method. In general terms, the pseudopotential formulation and augmentation 
are two competing approaches to introduce atomic detail into the wavefunction 
near the nuclei. When a pseudopotential formulation is used, this is implicit: 
although only smooth functions are manipulated during a calculation, the true 
wavefunctions could be derived from these in a well-defined manner. When aug- 
mentation is used, the basis functions are explicitly constructed to show the 
required strongly-varying and oscillatory character close to an atom. The first 
step is to cut space into atomic spheres and an interstitial region. Throughout 
the interstitial region, the basis functions are equal to suitable smooth “envelope 
functions” which in the present case are the smooth Hankel functions introduced 
above. Inside each atomic sphere, every envelope function is replaced by a nume- 
rical solution of the Schrodinger equation. Specifically, in the well-known linear 
methods [1], numerical solutions of the Schrodinger equation in the spheridized 
potential and their energy derivatives are combined to match smoothly to the 
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envelope function at the sphere boundary. Overall, this procednre amounts to a 
piecewise definition of the basis function in different parts of space. 

Comparing the two approaches, the norm-conserving pseudopotential formu- 
lation [15] has a number of advantages, once the initial effort of constructing 
the pseudopotential has been completed. Angular momentum cutoffs arc gene- 
rally low and an expression for the force is easy to obtain. In contrast, since 
augmentation works by cutting out the part of space near the nucleus and trea- 
ting it separately, quantities must sometimes be expanded to relatively high 
angular momenta. Due to the complexity of the augmentation procedure, it is 
often difficult to derive a valid force theorem. In practice, the augmentation and 
pseudopotential approaches have some similarity. Both methods expand a set of 
smooth basis functions by angular momentum around the different sites, then 
operate on the different angular momentum components independently. This 
suggests that a more unified description should be possible. 

With the aim of a practical formulation for all-electron methods which has 
the simplicity and transferability of the pseudopotential approach, the formu- 
lation of augmentation described in the following shares some aspects with the 
projector augmented-wave (PAW) method [5] and has some similarity with Van- 
derbilt’s ultrasmooth pseudopotentials [6]. As in the PAW method, “additive 
augmentation” is used to reduce angular-momentum cutoffs for the representa- 
tion of the wavefunctions, charge density, and potential. However, the present 
approach eliminates the need to construct pseudopartial waves and projector 
functions. Also, completeness of the partial-wave expansion is not an issue, the 
force theorem here seems to bo simpler, and orthogonality to the core states is 
automatic. 



Augmented Basis Functions 

As will be described in more detail below, the crystal potential will be written 
in the following form: 



V(r) = Vo(r) + ^{Fm(r)-I/2.(r)} (13) 

1/ 



Here Vq is a smooth potential extending through the unit cell, tabulated on a 
real-space mesh; is the true potential inside the atomic sphere v, given as 
an expansion in spherical harmonics times numerical radial functions, and V 21 , is 
the smooth mesh potential expanded in the same way. The tilde over a potential 
term indicates that compensating charges modeled by gaussians entered into the 
electrostatic contribution. One sees that the potential is expressed as a smooth 
function Vq which is augmented by adding a local term Vu, — ¥ 2 ,^ inside each 
sphere. The relevant point is that the local expansions of Vu, and V 21 , can be 
truncated at the (same) low angular-momentum cutoff with small loss of 
accuracy. To obtain a smooth overall potential, Vi^, and V 21 / should have the 
same values and slopes at the sphere surface. The higher ^-components of the 
two functions are closely similar and can be left away in the difference — ¥ 21 ^. 
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This does not mean that the higher angular momentum components are set to 
zero in the total potential; instead, they are carried by the smooth mesh potential 

After identifying the form in which the potential is specified, the next step 
is to calculate the Hamiltonian and overlap matrix elements for the basis fun- 
ctions Xi(r), obtained by augmenting the (as yet unspecified) smooth envelope 
functions Fi{r). The required integrals over the unit cell are 



Hyj = j x*i{r)[-A + V{r)\xj{r)dr 


(14) 


S^J = J xl{r)Xj{r)dr . 


(15) 



At this stage, the envelopes could be the smooth Hankel functions, plane waves, 
or some other set of smooth functions extending through the unit cell. 

In order to perform the augmentation, the first step is to project out local 
information about the envelopes near a chosen site. This can be done using some 
local set of radial functions, denoted by PkL, used to expand the i-th envelope 
as 

F,ir)=Y,CitPkL{r) . (16) 

kL 

The notation anticipates that we will later use the polynomials of Section 2.2, 
but in this context the PkL are general functions with well-defined angular mo- 
mentum and sufficient radial degrees of freedom. 

To augment, we first construct functions PkL which are augmented versions 
of the separate PkL- Assuming PkL is given in the form 

PkL{r) = Pki{r)YL{f) (17) 

then the augmented version is defined as 

PkL[r) = Pki{r)YL{f) = ^Akicjjiir) -|- Hfc;(^/(r)j 1^(7) (18) 

where Pkii^) equals the contents of the square brackets. In the standard way, 
and <pi{r) arc a specific solution of the radial Schrodingcr equation and its 
energy derivative, respectively. The coefficients Aki and Bki are chosen so that 
Pki and pki have the same value and derivative at the muffin-tin radius Ry^t ■ The 
augmented envelope function then is 

Fi{r) = Fy{r) + ^C«{PfcL(r) - Pfe^r)} . (19) 

kL 

At hrst sight, this expression seems slightly nonsensical. Using (16), the first term 
should cancel against the last, leaving only a sum over the PkL- The relevant 
feature is that, when the sums over k and L are truncated (as will be assumed 
from here on) , the result is still close to the complete sum. Exactly as in the case 
of the potential, we start with a smooth function containing all components up 
to infinity and replace only a few of the lower terms by numerical functions. 
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Overlap and Kinetic Energy Matrix Elements 

The next task is to calculate overlap integrals for the augmented envelope fun- 
ctions. These integrals are expressed as follows: 

F*F, dr= [ F*F, dr+Y, (20) 

kk'L 



where 



(^kk'l = J ^PkLPk'L - PkhPk'L'^ dr ( 21 ) 

is an integral over the atomic sphere. For simplicity in the notation, it was 
assumed that there is one atom in the unit cell and that PkL is real. Note that 
o'kk’i depends only on /, not on L=(l,m). In (20), the integral is evaluated by 
first calculating the overlap between the smooth envelopes as an integral over the 
whole unit cell. Then local information about the envelopes is projected out in 
the form of the coefficient vectors and . Finally, the product OtcrClO') 
is added, where <7 is a small symmetric matrix characterizing the atom at this 
site. 

It should be pointed out that equation (20) is not formally equal to the 
integral over F* Fj when these functions are given by (19). Instead, the straight- 
forward integral over the product would lead to a large number of unwieldy 
cross terms. The point is that (20) gives the correct result when all sums are 
taken to infinity. For finite cutoffs, the result is a good approximation to that 
which would be obtained if no truncation were done. As before, this follows be- 
cause the discarded terms in the integrals over PkkPk'L and PkhPk'L are similar. 
Therefore the truncation errors cancel to a large extent when the the difference 
between the two contributions to a is taken. Thus, the question of leaving away 
the cross terms in the expression for this and similar matrix elements is purely 
a convergence issue: we have an expression which is correct when all terms to 
infinity are included, bTit which must be truncated to a suitable finite sum for a 
practical calculation. Consequently it is of high priority to arrange matters for 
rapid convergence. 

As a second comment, let us assume for a moment that the radial augmen- 
tation functions 0/ and <j)i arc kept frozen throughout the calculation. This is 
usually a reasonable procedure in practice, although it was not done in the cal- 
culations presented further on. Then the augmented expansion functions PkL are 
also invariant and consequently the local matrix Okk'i is completely independent 
of the environment. The formulation thus begins to approach that of a unique 
and transferable pseudopotential. 

Analogous to the overlap integral, the kinetic energy integrals are given as 
follows: 



F:[-A]Fj dr= F*[-A]Fj dr + Y ctl^rkk'iC'i^L 

^ kk'L 



( 22 ) 
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where 

Tkk'l = J ^PkL[-^]Pk'L - -PfcL[-^]-Pfc'i| dr . (23) 

Although it is not immediately apparent, the local kinetic energy matrix Tkk'i 
is also symmetric. When the operator —A is moved from the second to the first 
function under each integral, two surface terms over the sphere boundary arise. 
These cancel because PkL and PkL match in value and slope. As in the case of 
the overlap, Tkk'i is independent of the environment if the radial augmentation 
functions (pi and (pi are kept frozen during the calculation. 

Potential Matrix Elements 

The potential matrix element is somewhat more complicated. The electrosta- 
tic potential inside a given sphere depends not only on the density inside this 
sphere, but also on the density in all other parts of the unit cell. This is one 
reason why it is not easy to separate out a transferable local potential from 
the overall eigenvalue problem, even though this is a fundamental feature of the 
pseudopotential approach. 

In precise terms, the potential is an auxiliary function needed to minimize 
the total energy respective to the trial density. Taken times some density va- 
riation and integrated, the potential should give the first-order response of the 
electrostatic and exchange-correlation energy for the given variation of the trial 
density. As described below, when evaluating the electrostatic energy, compensa- 
ting gaussians are added to the smooth mesh density to make a “pseudodensity” 
with the correct multipole moments in the spheres. This construction enters into 
the total energy and, as a consequence, gives rise to certain terms in the poten- 
tial matrix elements. To formulate this properly, write the total crystal density 
in a way similar to the potential (13) as 



n(r) = no(r) + |pi(r) - P2(^’)| 



where Uq is a smooth function on the real-space mesh and pi, p 2 are true and 
smooth local terms defined only inside an atomic sphere. (For the case of several 
atoms per unit cell, the term in braces is replaced by a sum over the spheres.) 
If the local sphere density pi — p 2 has multipole moments qm, the compensated 
mesh density is 

no{r) = no{r) + '^qMGM{r) (25) 



where Gm is a gaussian of moment unity with angular momentum M, localized 
inside the muffin-tin sphere except for a negligible tail. 

Similar to the other matrix elements, the potential energy integral can be 
written in the form 

f F:VFjdr= f F*VoF,dr+ ^kl*^kk'LL'Ci% . (26) 

'J Ul.l T T / 
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Again, the integral is first evaluated for the smooth mesh functions, after which 
the projection coefficients are combined with a small local matrix to add on the 
contribution of an atomic site. Here, the local potential matrix iTkk'LL' is dia- 
gonal in L only if the sphere potential terms Vi and Vi are taken as spherical. 
To derive an expression for TTkk'LL' wc inspect the changes in the electrostatic 
and exchange-correlation energies due to a variation of the charge density. Pre- 
senting only the results, the final expression involves the multipole moments of 
PkLPk'L' — PkhPk'L' ■ 

Qkk’LL’M = j ^PkbPk'L' ~ PkLPk'L'^ dr . (27) 

The matrix TTkk'LL' turns out to be the sum of two contributions. The first term 
involves the smooth potential Vq for the compensated mesh density: 



mesh 

^kk'LL' 



Qkk'LL'M 

M 



VoGMdr . 



(28) 



The second term involves the true and smooth local potentials, Vi and Vi : 



local 

^kk'LL' 



PkhViPk'L' - PkLV 2 Pk'L^ j dr 
V2GMdr . 



— ^2 Qkk'LL'M 
M 



(29) 



Of course, 28 will cancel to a large extent against the last term in 29 since Vi 
should be a local representation of the mesh potential Vq. It is necessary to 
include all the terms as described in order to minimize the total energy exactly. 

These expressions have moved the situation quite a bit towards the desired 
environment-independence of all local atomic terms. In (29), the true potential 
Vi is felt by the true partial density PkLPk'L' while the smooth potential Vi is 
felt by smooth partial density PkLPk'L' + '^Qkk'LL'mGM- These two partial 
densities have the same multipole moments; this was essentially the dehnition of 
the quantity Qkk'LL'M- If we modify the boundary conditions for the electrosta- 
tic potential inside the sphere in arbitrary way, this just adds the same linear 
combination of the harmonic functions r’^YM{r) to both Vi and Vi (see below). 
It follows that the result of (29) does not depend on the electrostatic boundary 
conditions on the sphere, which can therefore be set to zero. The result is that 
(29) defines a quantity which can be calculated completely and unambiguously 
with only the density inside the sphere as input. However, this does not yet make 
T^'kkML' ^ fixed quantity when the PkL are kept frozen (as was the case for the 
overlap and kinetic energy matrix elements) . The reason is that the local sphere 
density pi — p 2 can change in the course of a calculation, which will lead to chan- 
ges in Vi and ¥ 2 - Keeping the local sphere density frozen is thus an additional 
approximation, albeit reasonably plausible, which must be assumed in order to 
get full independence of from the environment. While this was not done in 
the current method, further investigations in this direction would be interesting. 
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Finally, we discuss the significance of the term given by (28). Aug- 

mentation modifies the charge density inside a muffin-tin sphere, including the 
total sphere charge as well as the higher multipole moments. In this sense our 
procedure is much more similar to Vanderbilt’s ultrasmooth pseudopotential 
approach [6] than to the usual norm-conserving formulation. In the smooth re- 
presentation of the problem, the sphere charge and the higher moments are 
mapped onto gaussians of the correct normalization. The term in (28) describes 
the interaction of these gaussians with the smooth mesh potential. 

Construction of the Output Density 

To complete the self-consistency loop, the output density must be constructed: 

n°’^‘(r) = ^u;„|V'„(r)|2 (30) 

n 

where the sum runs over the occupied eigenstates and the are occupation 
numbers including spin degeneracy. Each wavefunction is a linear combination 
of the basis functions in the form 

i’n{r) = ^TinFin{r) (31) 

i 

and the output density is a sum over the products F* Fj : 

n-‘(r) = ^{^u;„7);r,„}F’*(r)T)(r) . (32) 

ij n 

The output density should be expressed in the same form as the input density 
in (24). That is, it should be given by a smooth mesh density n™* together with 
separate true and smooth local contributions and for each sphere v.. 
In view of the preceding discussion of the augmentation process, the product of 
two augmented basis functions should be calculated as 

F:F,=F*F^F ^ C^l*[PuLpk'U-PkLPk-L'}c% (33) 

kk'LL' 

where we have again assumed one atom per unit cell for notational simplicity. 
The integral over this quantity gave the overlap matrix, and the integral over 
this quantity times the potential gave the potential matrix element. In the latter 
case, the first term interacted with the smooth mesh potential Vqj the second 
with the local true potential Vi, and the third with the local smooth potential 
V 2 in the sphere. Correspondingly, the accumulated sums over the first, second, 
and third terms produce n™*, p°'p, and p^'P, respectively. 

2.4 Representation of the Density and Potential 

In the method presented here, some effort was taken to formulate matters such 
that low angular-momentum cutoffs can be used. As already mentioned above. 
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quantities defined throughout the crystal are represented as a smooth function 
which extend through the whole unit cell, plus contributions which are non- 
zero inside the mufhn-tin spheres. This “additive augmentation” formulation has 
advantages over a piecewise definition, which would replace the smooth function 
by another quantity inside a sphere. It is the key to good accuracy at low angular- 
momentum cutoffs because it permits the smooth interstitial function to supply 
the higher angular momentum components. 

To represent the valence density, we start with the smooth function no(r) 
tabulated on the real-space mesh. For each atom v we carry about a “true” 
density, which is to be added on, and a “smooth” contribution, which is to be 
subtracted and should equal the local expansion of no(r) except for the angular- 
momentum cutoff. The potential is written in a similar way. Collecting together 
the corresponding expressions from above, we have 

n(r) = no(r) + J2{pii^(r) - P2^(r)j (34) 

V 

and 

V(r) = Co(r) + ■ (35) 

Here includes the core and the nucleus, and piv includes their smooth 
“pseudo” versions, modeled by localized gaussians. Each local contribution = 
P\v — P 2 u is a sum over various angular momentum components, is non-zero only 
inside the corresponding MT sphere, and goes to zero smoothly as it approaches 
the MT radius As already emphasised, even if pi, is truncated to a low 

angular momentum (possibly only to its spherical part) the full density n(r) 
includes contributions for all L up to infinity. At high angular momentum, the 
radial part for any smooth function approaches a constant times rK It is point- 
less to include these components explicitly in the local representation, since they 
are already contained in no(r). 

A special situation arises if the core states are extended enough to spill out 
of the muffin-tin sphere. In such a case, the smooth core density is written as the 
sum of a gaussian and a single smooth Hankel function of angular momentum 
zero, whereby the latter term describes the spilled-out tail core density. In the 
“frozen overlapped core approximation” (FOCA) [16] the smooth crystal core 
density is obtained by overlapping these atomic contributions. This means that 
a further parameter enters, namely the smoothing radius which defines this core 
Hankel function. Whereas a large smoothing radius makes it possible to use a 
coarser real-space mesh to represent the overlapped core density, generally the 
correct core density outside the sphere can be modeled more accurately using a 
smaller value of the smoothing radius. 

The potential is made by solving the Poisson equation for the input density 
and adding the exchange-correlation potential. This is described next. 
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Electrostatic Energy and Potential 

At the start of each iteration, the crystal density is available in the form given in 
(34) above. To make the electrostatic potential, hrst a smooth “pseudodensity” 
is constructed which equals the given density in the interstitial region and has 
the correct multipole moments in all spheres. This is done by adding localized 
gaussians to the smooth mesh density with the same multipole moments as 
the local contributions pi^ — p 2 u within the spheres: 

no = no + '^gy =no + '^qyMG,,M (36) 

V I/M 

The electrostatic potential of the smooth mesh density no(r) is made by trans- 
forming to reciprocal space by fast Fourier transform, dividing the coefficients 
by the squares of the reciprocal vectors, and transforming back. The electrosta- 
tic potential due to the gaussian terms includes the contribution of the nuclei 
and is handled analytically in order to avoid the need for a higher plane wave 
cutoff. Together, this produces an electrostatic potential V()®‘^(t’) which is valid 
throughout the interstitial region and extends smoothly through the spheres. 

Next, the electrostatic potential inside the spheres is determined. To recover 
the true density from the smooth pseudodensity, the following quantity must be 
added at each site ly. 



Piu — {P2v + 9u) — Piv — p2u ■ (37) 

By construction, this local contribution has multipole moments which are zero. 
We solve the Poisson equation twice to obtain the “true” local potential V)®® and 
the “smooth” local potential 

AVZ = -87TP1, (38) 

^^ 2 t = -87r(p2<. + gu) ■ (39) 

The source terms ph, and p 2 u = P 21 / + 9i/ have the same multipole moments, and 
we are really only interested in the difference E®® — I 2 ®®. Thus, the boundary 
conditions when solving the Poisson equation are here irrelevant, as long as the 
same set is used in both equations, and can be set to zero. 

Finally, the electrostatic energy is obtained by adding together a smooth 
mesh term and local terms for all sites: 

I «o(r-)Eo®®(r) dr + [ Pii/^Z dr - f p 2 i/V^^ drj . (40) 

As a general note on the energy integrals, expression (40) for the electrostatic 
energy involves an important convergence issue similar to the one discussed for 
the augmentation procedure. If Equation (34) for the density is taken seriously, 
it should be possible to add together piy and —p 2 i/ for each site from the very 
beginning. However, in our approach the program explicitly carries about both 
functions in separate arrays, making it possible to evaluate the electrostatic 
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energy as described. In fact, the snm in (40) is not strictly the exact electrostatic 
energy for the density in (34) for any finite angular-momentum cutoff; again, a 
straightforward evaluation would lead to a large number of problematic cross 
terms. 

In the description used here, there is a “true” and a “smooth” part to the 
problem which are kept separate. When evaluating energy integrals like the one 
above, the true density is not allowed to interact with the smooth potential and 
vice versa. The choice between the two possible expressions for an energy integral 
is only relevant for the rate of convergence with the angular-momentum cutoff, 
and not for the final outcome at convergence. When all angular-momentum 
components up to infinity are included, the distinction between the different 
ways to calculate the integral disappears. In the end, the main consequences of 
the formulation as in (40) are: 

• In the resulting energy integrals, the higher angular momenta are supplied 
by the smooth density no(r). That is, we start with an expression which 
contains all angular momentum components up to infinity in each sphere, 
then replace a few of the lower momentum terms. 

• The aforementioned unwieldy cross terms do not have to be evaluated. 

• The augmentation procedure described above cleanly separates into a smooth 
part plus local contributions, whereby each local term is independent of the 
environment of the atom. 

Exchange-Correlation Energy and Potential 

In the same spirit, the exchange-correlation energy is calculated by integrating 
a smooth function over the whole unit cell, then replacing the smooth contribu- 
tion by the true one inside each MT sphere. Things are simpler here than for 
the electrostatic energy because no additional terms are needed to correct the 
multipole moments. Thus, the smooth exchange-correlation potential is made 
by evaluating it point-by-point for the smooth mesh density no{r). Adding this 
to the electrostatic potential for the compensated density gives the total mesh 
potential 

Vo{r) = + /Txc(no(r)) (41) 

Inside each sphere, two different exchange-correlation potentials are made 
and added respectively to the true and smooth local electrostatic potentials: 

Vm(r) = P®"(r) + (42) 

V2:.{r) = V^^ir) + p^ciP2Ar)) (43) 

to produce the final potentials seen by the true and smooth densities at this site. 
The exchange-correlation energy is made by constructing the energy density e^c 
in exactly the same way and then adding together the smooth mesh term 



no£xc(no) dr 



(44) 
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and the sum over the local contributions 

/ /5ii.exc(/fi<.) dr - / P2uf^xc{p2u)dr . (45) 

Js^ Js^ 

Since £xc(p) is a nonlinear function of the density, it mixes together all angular 
momentum components up to infinity. That is, £xc(pl + Pk) is not the same as 
£xc(/3l) + £xc(Pif) and we should really include all L terms, even when making 
something as fundamental as the spherical part of £xc(p)- By evaluating the 
exchange-correlation energy as described, the interaction with the higher angular 
momenta is taken over by the smooth mesh contribTition. When ph^ and p 2 i/ are 
truncated to the same low angular momentum, the errors in the two integrands 
Piiy^xcipiu) and P 2 (/£xc(P 2 (/) are similar. Thus, while the two integrals in (45) 
converge rather sedately with the L cutoff separately, their difference converges 
rapidly. 

Force Theorem 

The force on an atom is defined as the negative gradient of the total energy 
respective to the corresponding atomic coordinates: 

= -V f_iEsc(Ri, ■ ■ ■ , Rfi, ■ ■ ■ , Rn) ■ (46) 

Here Esc is the full self-consistent total energy, which depends only on the 
atomic positions. In terms of small differences, the system is made self-consistent 
at two slightly different geometries and the total energies are compared. A force 
theorem is a closed expression, making it possible to calculate the forces 
without explicitly shifting the atoms. Since the self-consistency process mixes 
together the various energy terms, a straightforward differentiation of the total 
energy expression is a strenuous (and not always feasible) way to obtain such a 
theorem. 

A more convenient way to derive a force theorem is as follows [1 1] . Assume we 
want to calculate the forces at the geometry = {R^i \ ■ ■ ■ ,i?^^), for which 
the self-consistent density is known. For this purpose we make an arbitrary guess 
for the way in which the density responds as the atoms are moved. That is, we 
define a density hp{r) for each geometry P = (i?i, . . . , Rn) which conserves the 
total charge and which approaches the correct self-consistent density smoothly 
as P — >■ From the variational properties of the energy functional it follows 

that 

V^Fsc(-Ri, • • ■ , Rn) = V^F(Pi, . . . , Rn) (47) 

where E is defined as the Harris energy [17] evaluated for the guessed density: 

E{Ri, . . . , Rn) = En[np{r)] . (48) 

It follows that the forces can also be obtained by differentiating the auxiliary 
function E. This is a considerably easier task than differentiation of the self- 
consistent energy because the Harris energy can be written down explicitly as 
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function of the density. Furthermore, since the guessed function np{r) can be 
chosen freely, different force theorems can be obtained depending on this choice. 
Sensibly, fip[r) should be defined in a way which makes it easy to perform the 
differentiation of E. 

In the present formalism, the charge density is defined by a smooth mesh 
density no{r) together with true and smooth local terms pu^{r) and P 2 v{i') 
associated with each site v. If these functions represent the self-consistent density 
at geometry a natural choice for the guessed density hp(r) at a different 

geometry P is that pu, and p 2 i, are carried along rigidly with the moving atoms 
while no is unchanged. 

This choice leads to a simple force theorem, as will be shown next. When the 
core states are treated separately, the Harris energy takes this form: 

^ - j + C/ + + Teore (49) 

where Tcove equals ~ f and all eigenvalues are calculated in 

the effective potential Heff: made from the input density n™* + n'^°''®. Integrals 
(including the implicit ones in {7 and i?xc) are all assembled in a similar way out 
of contributions from the smooth mesh density and from the spheres. 

The aim is to derive the first-order change SEh for the density change defined 
above. It reasonably straightforward that the last two terms give no contribution. 
Furthermore, all integrals over the spheres do not contribute. This is because the 
total-energy terms and the augmentation matrices for each atomic sphere are in- 
dependent of the environment, depending only on the (here invariant) densities 
and p 2 v On the other hand, a number of terms arise because the electrosta- 
tic potential on the mesh changes by some amount 6Vq as the compensating 
gaussians move along with the atoms. Each compensating gaussian can be 
split into a valence part 3 ™* and a smooth representation of the core and nucleus 
p™. The compensated mesh density then equals 

no = no + ^g,,= [no + ^ -t ^ 5 ™ (50) 

where the term in brackets is the smooth representation of the valence density. 

The contributions to (5 Eh from the first three terms of (49) are obtained as 
follows: 

• By first-order perturbation theory, each eigenvalue e™* changes by 

JC‘ =< Cr,\3H - > (51) 

where H and S are the Hamiltonian and overlap matrices and is the 
column eigenvector. By summing over the occupied states and inspecting 
how Vq enters into the Hamiltonian, the outcome is 



(52) 
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Here <5^ refers to the eigenvalue change when the potential Hq and the aug- 
mentation matrices entering into H and S are kept frozen. 

• The change in the second term is 

-5 I n^Veff = - J[no + J2 9T'] <^^0 “ / <^5^' ■ (53) 



• The change in the third term is 

6U = J + 5g:^] . (54) 

By summing these three contributions, the final force theorem is obtained: 

6Eu = Jvo6g:’^ + 6^J2^n' ■ (55) 

Here the first term describes the force of the smooth density on the gaussian 
lumps which represent the core and the nucleus at each site. The second term 
is a generalized Pulay [18] term. It describes the eigenvalue shifts for a chan- 
ging geometry but invariant smooth mesh potential and augmentation matrices. 
However, these augmentation matrices are used at the shifted atomic positions. 
This term can be evaluated in a straightforward manner since it mainly involves 
the gradients of the various quantities (such as the expansion coefficients of the 
envelope functions) which are needed to assemble the Hamiltonian and overlap 
matrices. 

3 Tests of the Method 

We have performed a number of tests to check the validity of the method. This 
we do in two ways. First, we investigate the sensitivity of the total energy to 
the various parameters at our disposal, i.e. the charge density and augmentation 
i-cutoffs Ip and la, the augmentation polynomial cutoff ka, the smoothed density 
spacing, the treatment of the core, and the smoothing radii for the charge density, 
augmentation, and core, which we label respectively R-sm? Rgai- Second, 
we compare our results to other local-density calculations for several materials 
systems. Lastly, we present rules of thumb for the choice of basis. 

3.1 Dependence on I— and fc-Cutoffs 

Because the augmentation here shares a lot in common with a pseudopotential 
formulation, we expect that the Lcutoffs in the augmentation la and explicit 
representation of the charge density Ip to converge as efficiently. Indeed we find 
(see Fig. 3) that for la=2 the total energy is converged to ~10^^ Ry/atom, and 
to Ry/atom for ?a=3, even for the transition metal Ti. For a hxed la, 

a somewhat faster convergence in Ip was found: even lp=Q was quite adequate 
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Table 1. Parameters for baseline reference in the tests for GaN, Ti, and Se. i?mt, 
Rbiu and are in units of the average Wigner-Seitz radius. As fractions of touching 
sphere radii, the Rmt are 0.99, 0.97, and 0.96, respectively 





Rmt 


rye 

-^sm 


R” 

‘'sm 


ka 


la 


Ip 


mesh 


basis 


GaN 


0.69 


0.22 


0.17 


4 


4 


4 


20 


spd X 3 


(small basis) 
















spd X 2 


Ti 


0.88 


0.28 


0.19 


4 


4 


4 


24 


spd X 2-f-p 


(small basis) 
















spd X 2 


(minimal basis) 
















spd 


Se 


0.60 


0.20 


0.15 


4 


4 


4 


24 


spd X 2+sp 





Fig. 3. Errors in the total energy, in Ry/atom, as a function of the i-cutoffs Ip for fixed 
G=5 (top panel) and G, with lp=la (bottom panel). Solid line: GaN, dashed line: Ti; 
dotted line: Se. 
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for GaN and Ti, but this may be an artifact of the relatively high symmetry of 
those lattices. 

The polynomial cutoff ka in the P^i expansion is another parameter whose 
convergence was checked. Because the tails have a rather weak energy depen- 
dence, we would expect that already after two terms (fca=l) the total energy 
should be reasonable. Fig. 4 shows that the energy is converged to ~10^^Ry/atom 
for fca=l (though the Ti calculation produced nonsensical results in that case), 
and ~10^'^Ry/atom for ka=2. It is evident that the convergence of ka will be 
sensitive to the augmentation smoothing i.e. the smoothing radius for which 
one projects tails of the envelope functions into polynomials inside the augmen- 
tation site. The larger one makes the more broadly dispersed the errors 

in the polynomial expansion; the optimal choice of distributes the errors 
most evenly throughout the augmentation sphere. From experience we have fo- 
und that Rgjf, ~ J2int/3 is approximately the smoothing for which the most rapid 
convergence with ka is attained. 




Fig. 4. Errors in the total energy, in Ry/atom, as a function of the fc-cutoff. Solid line: 
GaN, dashed line: Ti; dotted line: Se. 



3.2 Dependence on MT and Smoothing Radii 

One critical test of the theory’s validity is its dependence of the total energy on 
muffin-tin radius Rmt, because by changing R^t one changes the representation 
of the basis. If the basis is complete, and all the cutoffs {R, Ip, ka and mesh 
spacing) and smoothing radii (Rgm, Rgm) fine enough, there should be no 

dependence on Rmt- Indeed, we find this to be essentially true for the cases we 
studied (ZB GaN, Se, and Ti). Parameters for which the sensitivity was checked 
in the greatest detail were the core smoothing radius Rgm, and truncation of the 
basis. Using the reference parameters in Table 1, the total energy was found to 
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be constant for all three tests to within ~1 mRy/atom when the MT radius was 
varied from '^0.85x Rx, to where Rt is the touching-sphere radius. 

Fig. 5 illustrates the dependence for GaN and Ti, for both the baseline case and 
when one parameter was changed relative to it. 

That the energy is independent of R^t for i?nit < Rt is a demonstration 
that the errors can be well controlled, and the energy converged to a high pre- 
cision as with the LAPW technique. That the energy is is independent of i?mt 
for i?nit > Rt is less obvious, since spheres overlap and wave functions have 
different, and inconsistent representations. Previous implementations of full- 
potential, augmented-wave programs do not show this independence, and we 
attribute this to the fact that both the augmentation and the local contributi- 
ons to the potential go smoothly and differentiably to zero at the MT boundary; 
thus the overlapping regions are “doubly counted” with a very small weight. 

Fig. 5 shows that the energy is rather sensitive to the core smoothing radius. 
This is not surprising, since the core energy is very large and small changes in 
the core density can produce significant effects. But, energy differences between 
different structures should be significantly smaller than this. Also, as expected, 
there is a much stronger dependence on the total energy when the basis is made 
small. (This error can be reduced by re-optimizing the wave function parameters 
for each new choice of MT radius. No attempt was made to do this.) As one 
shrinks i?mt) the interstitial volume increases and throws greater weight into 
the interstitial representation. The figure also shows that R^^ must be small 
enough to really properly conhne the projection of the MT potential into the 
augmentation sphere, or about 0.25xi?mt- .^sm can bo set sot to about 0.4xi?mt> 
negligible loss in accuracy, and up to about O.bxi^mt, with a loss in absolute 
accuracy of ~1 mRy/atom, provided i?mt is kept close to touching. This is quite 
adequate for most applications, since such error will cancel when comparing 
relative energy differences. Some information about the MT dependence in Se is 
discussed in the comparison to other LDA results. 

3.3 Dependence of the Total Energy on Basis 

Having the extra degrees of freedom in the basis is one key advantage the present 
scheme has over conventional LMTO approaches. The two degrees of freedom, 
the energy e and smoothing radius Rg-m can be adjusted for each orbital. As we 
show here, a minimal basis of spd orbitals produces ground state energies within 
a few mRy of the completely converged LDA energy in close-packed systems; 
similar convergence can be achieved with an spdsp basis in open structures. 

As of yet, there is no automatic prescription for finding the optimum choice 
of these parameters suitable to the solid, and we offer some general rules of 
thumb based on practical experience. One obvious choice is to determine Rgm 
and e from the free-atomic wave functions. This turns to be a reasonable choice, 
particularly for the compact orbitals such as transition metal d orbitals or deep 
states such as the Ga d semicore levels. 
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Fig. 5. Dependence of the total energy, in Ry/atom on muffin-tin radius. Reference 
parameters are provided in Table 1. Top: Ti; bottom: ZB GaN. Arrows mark MT radius 
for touching spheres. The large (short dashed line) correspond to Rsm^O.GxRT’, 
the reference Rsm (solid lines) correspond to ~0.32xRt. The large R§^ (long dashed 
line) correspond to Rsm~0.3xRT; the reference R§m correspond to R§^ ~0.25xi?T. 
Calculations repeated using the perturbation approach to the core-valence interaction 
are indistinguishable from the ones shown. 



rs(N) 
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rs(Al) 



Fig. 6. Contours of the deviation of the total energy from the optimum value, in 
mRy/cell, as functions of Asm for A1 and N. All the orbitals on each atom were given 
the same Asm- There are four minima, at (Asm(Al),Asm(N))^ (0.95,1), (0.85,3), (1.9,1), 
and (1.6,3). All the minima are within 4 mRy of each other. 



Interestingly, the extended sp orbitals usually have two very different opti- 
inuin values of Rsm- This is illustrated for AIN in Fig. 6. Sonic rules of thumb, 
gathered from experience for a number of cases, are: 

• For localized, narrow-band orbitals, the values of Asm and e fit to the free 
atom are close to optimum. 

• Reasonable choices of e are — 1 < e < 0. For wide-band orbitals, the best 
choice of e is near zero. 

• For sp orbitals, the “small” optimum Asm is usually w2/3 Amt- The “large” 
optimum Asm is ssl.5 Amt and mostly resembles a gaussian orbital in the 
near Amt- 

• For close-packed structures, a minimal basis consisting of the nine spd orbi- 
tals (16 for rare earths) is sufficient to produce a total energy with ~5mRy/atom 
or so of the totally converged LDA result. The error is slightly larger for hea- 
vier elements, unless / orbitals are included. 

• For open structures, a basis consisting of thirteen spdsp orbitals usually 
produces a result with a similar accuracy. In that case, choosing one set of 
sp orbitals with a “small” Asm, and the the other with a large Asm seems 
to work well. There is an occasional exception to this rule; for example, the 
deep Ga d states in GaN require two sets of d orbitals to reach this accuracy. 
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• The fastest convergence in ka was achieved for is ~l/3xi?„it; using 

that value, ka=‘2 or 3 was adequate for all systems we studied. 

3.4 Comparison with Other Density-Functional Calculations 

First, we investigate the cases of elemental Se and Te. These form an open, 
low-symmetry crystal with approximately 90-degree bond angles. The electronic 
structure is approximately described by pure atomic p orbitals linked together in 
one-dimensional chains, with a strong a bond connecting neighbors in the chain 
and with a weak interaction between the chains. The weak inter-chain interaction 
combined <j-like character of the bond make the stiffness of the crystal weak with 
respect to angular distortions, and thus pose a rather stringent test for the local- 
density approximation, and the low symmetry and open structure make a a good 
test for an atom-centered method’s method’s ability to reproduce the converged 
local-density result. The crystal structure of Se and Te is hexagonal with three 
atoms per unit cell, and may be specified by the a and c parameters of the 
hexagonal cell, and one internal displacement parameter u. Using a plane-wave 
pseudopotential approach, Dal Corso and Rcsta[19] have calculated the bonding 
in some detail, and they have also calculated the equilibrium structure of Se for 
both the LDA and adding gradient corrections. 

Table 2 shows our LDA structures agrees well with that of Corso and Resta. 
For both calculations, the LDA predicts reasonably well the strong intra-chain 
bond length, but rather poorly the inter-chain bond length. The table also shows 
the dependence of the structural parameters and total energy on the size of 
basis and the MT radius. The reference basis consisted of one spd group with 
/?sm=l-3, e=-0.1 Ry and an additional sp group with Rsm=l-4, e=-l Ry, for 13 
orbitals. Adding a second d and a third s orbital lowers the energy by 2/3 iiiRy 
per atom; adding an / orbital lowers the energy by an additional 2.5 mRy per 
atom. Thus, the 13-orbitals basis comes to within ~ 3 mRy/atom of a totally 
converged energy. Also, the table illustrates the dependence on Kmt- the lattice 
parameters vary weakly and the total energy varies by ~1 mRy/atom when 
Rmt is changed from -10 to 10 percent of touching for the minimal basis; the 
dependence is slightly weaker with a large basis. 

Next, we examine in some detail the behavior of the (Al,Ga,In)N compounds, 
because it involves several useful tests of the method’s validity. The deep Ga 
and In d levels must be included in the valence for a well-converged calculation. 
(Pscudopotential results that pscudizc the d eleetrons differ considerably from 
those that do not.) Even though the states are deep, it turns out that they must 
treated carefully. For example, to produce accurate heats of mixing of the GaInN 
alloy it was found that two sets of Ga and In d orbitals were required. 

Our next case concerns the [111] y-surface of Ni. The [111] glide planes are 
connected with dislocation motion, and therefore are of significant practical inte- 
rest. To model the 7 surface with the periodic boundary conditions necessitated 
by the present computational approach, we employ a supercell of 8 [ 111 ] planes 
of Ni atoms, with the fault between the fourth and fifth planes. The first and 8 th 
layers were separated by a vacuum of two layers. Lattice vectors in the plane of 
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Table 2. Crystal structure properties of Se. Lattice parameters a and c are in atomic 
units, as are intrachain bond length di and interchain bond length d 2 ; u is an internal 
displacement parameter as described in Ref. [19] and E is the binding energy relative 
to the free atom, in Ry 





a 


c 


u 


d\ 


d.2 


E 


“, Rn,t=2.11 


7.45 


9.64 


0.258 


4.63 


5.83 


-0.7121 


“, Rmt=2.32 


7.40 


9.57 


0.266 


4.67 


5.75 


-0.7147 


“, Rmt=2A5 


7.40 


9.57 


0.267 


4.68 


5.73 


-0.7152 


“, Rn,t=2.58 


7.40 


9.59 


0.268 


4.69 


5.72 


-0.7149 


^ Rn,t=2.11 


7.40 


9.58 


0.259 


4.61 


5.78 


-0.7152 


^ Rmt=2.32 


7.41 


9.61 


0.264 


4.66 


5.76 


-0.7167 


=, Rmt=2.32 


7.42 


9.57 


0.264 


4.67 


5.77 


-0.7261 


^ R,nt=2.32 


7.38 


9.60 


0.270 


4.71 


5.70 


-0.7131 


% Rmt=2.32 


7.40 


9.59 


0.267 


4.72 


5.72 


-0.7131 


Ref. [19] 


7.45 


9.68 


0.256 


4.61 


5.84 




Experiment 


8.23 


9.37 


0.228 


4.51 


6.45 





“ 13 orbital basis 
**21 orbital basis 
21 +/ orbital basis 
lp=0 

" la=2 lp=2 



the fault were [110] and [112], so that each layer had two atoms. The total energy 
was calculated as a function of the translation r along [112]. It was assumed that 
the four layers separating the fault and the vacuum was large enough that there 
was no coupling between the fault and the surface. The thickness of the vacuum 
itself was checked and the energy shown in Fig. 7 change was negligible when the 
vacuum thickness was doubled, la and 1^ were taken to be 3, to be 0.4xi?„jt. 

Fig. 7 shows the energy of the Ni 7 surface, without allowing any atomic 
relaxation. It was calculated using a 9-orbital minimal basis of spd orbitals of 
energy -0.1 Ry and smoothing radius Rgm 2.0 a.u. and 1.0 a.u. for sp and d 
orbitals, respectively (dark circles), and then repeated for a basis enlarged by 
an spd set of energy -1 Ry, i?sm=l-3 a.u. (light circles). As the Figure shows, 
the addition of the larger basis made a 5% correction to the barrier. The BZ 
was integrated by sampling, using 27 irreducible points. A conservative mesh of 
15x24x120 divisions was used, corresponding to a mesh spacing of ~0.31 a.u.; 
the inset in Fig. 7 shows the error in the barrier as a function of the mesh spacing. 
It is seen that an acceptable error was obtained for a mesh spacing of ~0.5 a.u.. 
This corresponds to an error in the numerical integration of kinetic energy of 
~ 10^® Ry. 
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Table 3. Lattice properties of Ill-nitrides. Lattice constant a is in A. All data are 
for the wurtzite structure, except for rows marked a (ZB). Data marked OLD-FP are 
calculations from the present work, using the LDA and the exchange-correlation func- 
tional of Barth and Hedin [34] . Data marked GGA-LMTO employ gradient-corrected 
functional of Perdew [35]. Data marked PWPP are LDA results using a plane- wave 
basis and a pseudopotential with the Ga 3d and In 4d in the valence [33]. AEzb-wz 
is the ZB-WZ energy difference per atom pair, in meV 







AIN 


GaN 


InN 


Small basis 


a 


3.096 


3.163 


3.525 




cja 


1.600 


1.629 


1.606 




u 


0.3818 


0.3775 


0.3805 




a (ZB) 


4.353 


4.481 


4.961 




AEzb-wz 


54 


14 


23 


Large basis 


a 


3.095 


3.160 


3.509 




cja 


1.601 


1.629 


1.613 




u 


0.3818 


0.3766 


0.3794 




a (ZB) 




4.478 


4.947 




AEzb-wz 


51 


12 


18 


Old-FPLMTO 


a 


3.091 


3.160 


3.528 




cja 


1.602 


1.626 


1.611 




u 


0.381 


0.377 


0.380 




a (ZB) 


4.345 


4.464 


4.957 


LAPW'" 


a 


3.098 


3.170 


3.546 




c/a 


1.601 


1.625 


1.612 




u 


0.382 


0.377 


0.379 




a (ZB) 


4.355 


4.476 


4.964 




AEzb-wz 


45 


11 


21 


PWPP-'^ 


a 


3.084 


3.162 


3.501 




cja 


1.604 


1.626 


1.619 




u 


0.381 


0.377 


0.378 




a (ZB) 


4.342 


4.460 


4.932 




AEzb-wz 


36 


22 


20 


Experiment 


a 


3.11'* 


3.190“* 


3.544" 




cja 


1.601'* 


1.626“* 


1.613" 




u 


0.382“* 


0.377“* 






a (ZB) 


4.38 “ 


4.52-4.55*“ 


4.98" 



“ Ref. [24] 

^ Refs. [25-29], taken from Ref. [23] 

" Ref. [22] 

Ref. [30] 

" Ref. [21] 

^ Structural properties taken from Ref. [33]; AEzb-wz taken from Ref. [20]. 
® Taken from Ref. [31] 

^ Taken from Ref. [32] 
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Fig. 7. The total energy for a 7 surface of [111] Ni, as as function of translation 
r/4[112], as described in the text. Upper panel: total energy (mRy/unit cell) for a nine- 
orbital basis (dark circles), an 18-orbital basis (light circles) and for a pseudopotential 
calculation (dotted line), using VASP. Inset: circles show error in the peak energy 
(mRy) as a function of mesh spacing in a.u.. Crosses show corresponding error in 
numerical integration of the kinetic energy of the d orbital on the mesh. Lower panel: 
X (circles) and y (squares) force on the atoms at the interface. Dotted lines show VASP 
result. 



These results were compared to a pseudopotciitial caleulatioii using Van- 
derbilt’s [6] ultrasoft potentials; this latter calculation was performed with the 
VASP code [36]. The VASP code used the same BZ integration, and the default 
mesh (16x24x108 divisions). Those results are illustrated by the dotted lines 
in Fig. 7. There is good agreement with VASP throughout except for a modest 
discrepancy for t 1. The source of the discrepancy is not clear; it would .seem 
that in the all-electron case, the remaining sources of error (truncation in basis, 
truncation in /, smoothing of the core) all appear to be controlled. 
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4 Summary 

In this paper, we have presented a newly developed full-potential LMTO method 
with the following properties: 

• The potential is treated without any shape approximation, based on a mufhn- 
tin type of geometry consisting of non-overlapping atomic spheres and an 
interstitial region. 

• Empty spheres are not needed for any system. 

• The results can be systematically improved by increasing the convergence 
parameters. 

• The forces are as accurate derivatives of the total energy. 

Similar to some previous FP-LMTO approaches, the interstitial density and 
potential are represented on a real-space mesh while the wavefunctions are as- 
sembled from atom-centered basis functions with well-defined angular momenta. 
In order to reduce the basis size further and to make the evaluation of integrals 
more efficient, we have introduced a new type of function which plays the role of 
the smooth envelope. These “smooth Hankel functions” are analytic in all parts 
of space and equal the standard Hankel functions at large radii. Closer to the 
origin, they bend over smoothly and are nonsingular. To introduce atomic detail 
into these envelope functions, a modified augmentation scheme was employed 
which combines some features of the projector-augmented wave method and the 
pseudopotential approach. By systematically using additive augmentation for 
the density, the potential, and the wavefunctions, the approach allows low an- 
gular momentum cutoffs throughout. Tests were performed to assess the quality 
of the method. Results have shown that state-of-the-art convergence can indeed 
be attained with low angular momentum cutoffs and small basis sets. Since the 
forces are calculated and the potential is treated without any shape approxi- 
mation, the method presented here can be used to relax atomic structures and 
to study the energy changes associated with perturbations of the atoms around 
their equilibrium sites. 
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Abstract. The essential features of a full potential electronic structure method using 
Linear Muffin-Tin Orbitals (LMTOs) are presented. The electron density and potential 
in the this method are represented with no inherent geometrical approximation. This 
method allows the calculation of total energies and forces with arbitrary accuracy while 
sacrificing much of the efficiency and physical content of approximate methods such as 
the LMTO-ASA method. 



1 Introduction 

This paper describes a particular implementation of a full-potential electronic 
structure method using Linear Muffin-Tin Orbitals (LMTO’s) [2,10] as basis 
functions. There have been several “FP-LMTO” implementations [3—7]. The one 
described here has not been published in detail, although calculations performed 
with this method have been reported for quite some time [3]. There are many 
aspects to an electronic structure method. This paper is focussed on those aspects 
which enable a full potential treatment. Relatively small details pertaining to 
full-potential methods will be discussed while larger details having to do with, 
for example, relativity will not be. 

The emphasis of a variational full-potential method is somewhat different 
from that of a method such as the LMTO-ASA method. The emphasis of the 
former is on the completeness of the basis while in the latter it is in the phy- 
sical content (and interpret ability) of the basis. These concepts are, of course, 
intimately related, but the emphasis is different. 

The exposition here is for an inhnite system periodic in three dimensions. 
This method has been implemented for two-dimensional systems, [8] but that 
will not be discussed here. 

Notation 

Papers on electronic structure methods unavoidably carry a high overhead in 
functional symbols and indices. It is simplest to define here, without motivation, 
the special symbols and functions that will be used in this paper, for future 
reference. These special functions (although not necessarily the symbols used 
here) have been used extensively in LMTO documentation and are largely due 
to Andersen. [10] 

H. Dreysse (Ed.): Workshop 1998, LNP 535, pp. 148-167, 1999. 
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Spherical harmonics: 
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where K is a spherical harmonic. [9] 



Bessel functions: 
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where L denotes £m and rii and ji are spherical Neumann and Bessel functions, 
respectively. 



Geometry: For computational purposes, the crystal is divided into non-overlap- 
ping spheres surrounding atomic sites (mujfin-tin spheres) where the charge den- 
sity and potential vary rapidly and the interstitial region between the spheres, 
where the charge density and potential vary slowly. This is the muffin- tin geome- 
try used as an idealized potential and charge density in early electronic structure 
methods (KKR and APW). Here, the division is a computational one, and docs 
not restrict the final shape of the charge density or potential. In the muffin- 
tin spheres, the basis functions, electron density, and potential are expanded in 
spherical waves; in the interstitial region, the basis functions, electron density, 
and potential are expanded in Fourier series. 

There are many relevant considerations in choosing muffin-tin radii. Assu- 
ming all expansions are taken to convergence, the density and potential depend 
on the muffin-tin radii only through the dependence of basis functions on the 
radii. As discussed below, basis functions have a different functional form inside 
the muffin-tin spheres, and the choice of muffin-tin radius affects this crossover. 
Hence, assuming the Hamiltonian is the same inside and outside the spheres (the 
treatment of relativity may affect this as discussed below), the muffin-tin radii 
are variational parameters and the optimum choice minimizes the total energy. 
If the basis is large enough however (suitably complete within and without the 
spheres), the energy is insensitive to the choice of radii. A reasonable choice 
results from choosing radii that are within both the minimum in charge density 
and the maximum in potential along a line between nearest neighbors. Relati- 
vistic effects are usually taken into account only in the muffin-tin spheres, in 
which case the Hamiltonian depends on the radii; hence when relativistic effects 
are important, the radii are not variational parameters. 
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In what follows, lattice positions are vectors R — Rn, integer multiples of 
a basis R. Atomic positions in the unit cell are denoted by t. A set of atomic 
positions invariant under the point group of the lattice are said to be of the same 
symmetry type, t. Similarly, in the reciprocal lattice, vectors are g — Gn for the 
reciprocal basis G — 2ttR^'^ . Brillouin zone (or reciprocal unit cell) vectors arc 
denoted by k. 



Symmetric functions: Within the muffin-tin region, functions invariant are 
expressed in harmonic series. If /(r) is such a function, at site t 
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(8a) 
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In Equation (8a), is a transformation to a coordinate system local to site r; 
the local coordinates of sites of the same type are related by an element of the 
crystal point group that takes one site into another. Expressed in this way, the 
functional form of Dht (Equation (8b)) depends only on symmetry type. 

In the interstitial region, symmetric functions are expressed in Fourier se- 
ries: 
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The sum in Equation (9a) is over symmetry stars S of the reciprocal lattice. 



2 Basis Set 



2.1 Interstitial 



In the interstitial region (symbolically I) between the muffin-tin spheres, bases 
are Bloch sums of spherical Hankel or Neumann functions: 









(P,. (r-r,-it)) 
R 



( 10 ) 



The rotation Vr in (10) takes the argument into a coordinate system local to 
each site r. The purpose of this will be made evident later. The function on the 
right hand side of Equation (10) is sometimes called the envelope function. 

Notice the parameters, specifying a basis function, inherent in this definition. 
They are the site r in the unit cell on which the spherical wave is based, the 
angular momentum parameters £ and rn of the spherical wave with respect to its 
parent cell, and the kinetic energy of the basis in the interstitial region. The 
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angular momentum parameters specifying the basis set are chosen to represent 
the atomic states from which crystal eigenstates arc derived. In the LMTO- 
ASA, it is usual to include I bases one higher than the highest relevant band. 
In the method described here, this is rarely necessary, possibly because of the 
multiplicity of bases with the same angular momentum parameters. It is usual 
to use “multiple k” basis sets, having all parameters except the tail parameter 
the same. 

There appears to be no simple algorithm for choosing a good set of interstitial 
kinetic energy parameters. Schemes such as bracketing the relevant energy spec- 
trum have been proposed. [7] The optimum set would minimize the total energy. 
This can be done but is time consuming even for relatively simple systems. It 
seems, however that parameter sets obtained in this way for simple systems in 
representative configurations can give good results when used for related sy- 
stems over a broad pressure range. Thus good sets are arrived at through some 
experimentation. The choice can be important as it’s possible to pick a set of 
parameters that will give very bad results, and the parameter set used in any 
new calculation should be always checked for stability. 



2.2 Muffln Tins 



In the muffin-tin spheres, bases are linear combinations of spherical waves mat- 
ching continuously and differentiably to the envelope function at the muffin-tin 
sphere. The envelope function Af may be expanded in a series of spherical Bes- 
sel functions about any site except it’s center. A basis function on a muffin-tin 
sphere in the unit cell at i? = 0 is therefore 
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( 11 ) 



where r^- = r — t and B is equivalent to the KKR structure constant. [10] The 
unitary transformation applied to B rotates components into site-local coordi- 
nates from the left and right. 

Equation (50) is compactly expressed by defining a two-component row vector 
K so that 



= {lCt{n,T),Ji,{K,r)) 



and a two component column vector S so that 
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Then the value of a basis function on a muffin-tin boundary is expressed simply 
as 







yL{'Drrr)Ke{Ki, t-t', k) (14) 

L 



The radial part a basis function inside a muffin-tin sphere is a linear combi- 
nation of atomic like functions (j) and their energy derivatives (j) [10,2] matching 
continuously and differentiably to the radial function K in Equation (14). Collec- 
ting (j) and 0 in a row vector 



U(e,r) = {(f>{e,r),^(e,r 



a simple case of this matching condition may be expressed as U(e, s)17(e, k) — 
K(k, s) and U'(e, s)f2{e, k) = K'(k, s), where 1? is a matrix of order 2. 

The use of these radial functions in the method described here is different 
than that used by most other methods, however. For the broadest utility, a basis 
set must be flexible enough to describe energy levels derived from atomic states 
having different principle quantum numbers but the same angular momentum 
quantum number. For example, describing the properties of elemental actinides 
at any pressure requires a basis with both 6p and 7p character. Similarly, an 
adequate calculation of the structural properties of transition metal oxides re- 
quires both semi-core and valence s and p states on the transition metal ions. 
The description of the evolution of core states from localized to itinerant un- 
der pressure also requires multiple principle quantum numbers per £ value. It is 
usual in LMTO-based methods to perform calculations for the eigenstates and 
eigenvalues of “semi-core” and valence states separately, using a different basis 
set, with a single set of energy parameters {e^}, for each “energy panel”. This 
approach fails when energy panels overlap, and has the disadvantage that the 
set of eigenvectors is not an orthogonal set. The problem of “ghost bands” also 
arises. [2] 

In the method described here, bases corresponding to multiple principle quan- 
tum numbers are contained within a single, fully hybridizing basis set. This is 
accomplished simply by using functions (p and 4> calculated with energies {e„^} 
corresponding to different principal quantum numbers n to describe the radial 
dependence of a basis in the muffin-tin spheres. The Hamiltonian matrix for 
an actinide, for example, will have elements [V'Tp) the overlap ma- 
trix elements We may formally express the radial part of basis i in 

a muffin-tin sphere by the function /(r) = ai(nI:)U(e„£, s)l7(e„^, /ti) but in 

practice it is sufficient to restrict the coefficients by ai{n£) = 6{n,ni) so that 
the basis set (although not eigenvectors) will have pure principal quantum num- 
ber “parentage” . This method of expanding the energy range of a basis set has 
been used (and reported) extensively. Representative calculations in which this 
method was essential are described in Reference [11]. 

Thus another parameter specifying a basis function is the set of energy para- 
meters {ete} that will be used to calculate the radial basis functions (pte and pu 
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used to express the basis function in muffin-tin spheres of each symmetry type. 
A basis function in a muffin-tin splierc is tficrcfore 
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(15) 



where means “use the energy parameter corresponding to the principal 
quantum number specified for basis i” and 



Uti(e,r) = yL{r)\Jte{e,r) 



(16) 



The necessary cutoff in angular momentum has now been made explicit. The 
2x2 matrix 17 matches U to K continuously and differentiably at the muffin-tin 
radius. Specifically, 17 is specified by 

{Me, St) Me,-n)'\ ^ / s, _ flCe{K,st) Je{K,st)\ 

U.(e,s,) ^ ^ JMst)J 

In principle, and as programmed, each (t£k) basis can use its own unique 
energy set. It is more usual to use a common energy set for a set of basis states 
giving rise to bands of similar energy within the scope of a particular calcula- 
tion. The configuration of the basis shown in Table 1 for example uses a set of 
energies for “semi-core” 6,s and 6p bases, and another set of energies to repre- 
sent “valence” bases. The calculation of energies in an energy parameter set is 
discussed below. 

A parameter introduced in (15) is the angular momentum cut-off £m- In most 
cases, a converged total energy is achieved with values £m ~ 6 — 8. Note that 
since a basis set generally contains functions based on spherical waves with £ < 3, 
the KKR structure constant in (13) is rectangular. 



Table 1. Parameters for typical basis set for an elemental actinide: parent angular 
momentum parameter (£), energy set for radial expansions (e-set), and the index of 
the kinetic energy in the interstitial region) k- index). A typical set of values, corre- 
sponding to the kinetic energy indices, is given at the bottom of the table. 
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3 Matrix Elements 



3.1 MufRn-Tin Matrix Elements 



The potential in a muffin-tin at r has an expansion in linear combinations of 
spherical harmonics invariant under that part of the point group leaving r in- 
variant: 



) ='^Vht{rT)Dht(Vrrr) 

'rr<St 
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(18a) 


Dhtif) = . 


(18b) 



m 



The utility of referring bases and potentials in muffin-tin spheres to site-local 
coordinates is apparent in (18a). If the site local coordinates of sites are con- 
structed so that V-r' = VrQ^^ for some Q such that Qt = r', then the harmonic 
functions Dht depend only on the symmetry type, rather than on each site. The 
normalization for the spherical harmonic in (18a) (C = 47 t/( 2£+1)3^) is chosen 

so that Vht{r) is the potential when £h = 0- 

Combining (15) and (18a), the potential matrix is 



{'ipi\V\^j) 
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The matrix element of the Dht is a sum over Gaunt coefficients: 



{L\Dht\L') = '^aht{mh)G{£', in'; £, m; 4, mh) 

fTih 

g{£', m'; £, m; £h, ruh) = y 



In electronic structure methods using muffin-tin orbitals, the muffin-tin energy 
parameters {e^ are usually taken from “Gprojected average energies” . With mul- 
tiple energy sets, this is a reasonable choice provided that the basis set, which 
uses separate sets, gives rise to bands well separated in energy. The ^-projected 
charge, integrated over a muffin-tin sphere, is a sum over cross terms between 
energy sets 



ij 

and must be made diagonal in some approximation for the resulting energy- and 
^-projected energies and charges to be representative. 
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Another criterion, particularly useful for states using different sets not well 
separated in energy or for states not having signiheaiit occupation is to maximize 
the completeness of the basis. To accomplish this, the energy parameter for the 
low energy state ei{\) can be set to a set of projected energy averages, and the 
energy parameters for the same £ in higher energy sets may be chosen so that 
the radial function has one more node and the same logarithmic derivative at 
the mufhn-tin radius, hence 

r^dr (pe{ei,r)(f>e{ei,r) = 0 , i>l . (20) 

Although this usually generates energy parameters out of the range of occupied 
states (since the logarithmic derivative of semi-core states is usually large in 
magnitude and negative), this choice seems to give a total energy close to the 
minimum with respect to this parameter. This is an example of the difference 
mentioned in the introduction in emphasis between an accurate “basis-set” me- 
thod and a method motivated by a physical model. 

The convergence of the harmonic expansion of the potential in a muffin-tin 
sphere (18a) depends, of course, on the basis, atomic constituents, and geometry. 
Using harmonics through = 6 is usually sufficient, and it has never been 

necessary to go beyond Ihrr^ax ~ 




3.2 Interstitial Matrix Elements 

Overlap and Kinetic Energy: The interstitial overlap matrix can be easily 
obtained from an integral over the interstitial surface (the only non-zero contri- 
butions, in a crystal periodic in three dimensions, come from the surfaces of the 
mufhn-tin spheres) and the kinetic energy is proportional to the overlap: 



tpl{r)tPj{r) 
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where W (/, g) — fg' — f'g. Basis functions on mufhn-tin spheres are given in 
(14), hence 
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In the limit Kj k?, the evaluation of (21) requires the derivative with respect 
to of the structure constant. 
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Potential Matrix Elements: The greatest difference between LMTO-based 
full-potential methods is in the way the matrix elements of the potential are 
calculated over the interstitial region. The method being described here uses a 
Fourier representation of basis functions and the interstitial potential to caleu- 
late these matrix elements. Other approaches for computing these elements are 
described in the literature. [4,5] 

A Fourier transform of the basis functions described in Section 2 would be 
too poorly convergent for practical use. However, the evaluation of the inter- 
stitial potential matrix requires only a correct treatment of basis functions and 
potential in the interstitial region. This degree of freedom can be used to design 
“pseudo basis-set” , equal to the true basis in the interstitial region although not 
in the muffin-tin spheres, and have a Fourier transform which converges rapidly 
enough for practical use. We define this pseudo basis set by 



ipi{k,r) 



rex 



ICi{K,r) 



R 

lCi{K, r), r > s, s < Sr 



(23a) 

(23b) 



Since rapid Fourier convergence is the criterion for constructing the pseudo- 
basis, it is useful to consider the Fourier integral of a Bloch function with wave- 
number k: 

Hot = i, ( 24 ) 

where Vc is the unit cell volume. Equation (24) is obtained by casting 
on the plane wave then doing two partial integrations; surface terms vanish due 
to periodicity. From (24) it is evident that the Fourier integral of a pseudo-basis 
satisfying the hrst criterion (equal to the true basis in the interstitial region) may 
be obtained from integral over muffin-tin spheres. If in addition, the pseudo-basis 
is different from a Hankel function only in it’s parent sphere, the Fourier integral 
is a finite integral over a single muffin-tin sphere. The problem then is to find 
a function ip such that {V‘^+K^)ip has a rapidly convergent Fourier integral, 
vanishes outside a radius less than or equal to the parent muffin-tin radius for 
the basis, and has a value and slope equal to JC at this radius. 

A good choice for such a function is obtained by solving 



+ k'^) ICe{K,r)yL{r) = -cey-j 1 ~ j yL{r)0{s - r) 



(25) 



for a radius s < s'<^, and with with C£ chosen to match on to A at s. This is 
easily done analytically. The resulting Fourier transform is 



A{k+g) 



4^ yL.(fc+g)e-’(^+g)-^ £^ JN{\k+g\,s) 

Vc (|fc + C/|2-Kf) ^ JN{Ki,s) 



(26) 



where N = £i+rii + l. The subscript i has been purposely left off N and s (see 
below) . 
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These coefficients converge like provided J’N{\k+g\, s) achieves it’s 

large argument behavior, and n can be chosen to optimize convergence. Weinert 
[12] used an analogous construction as tool to solve Poisson’s equation. He pro- 
posed a criterion for the convergence of the Fourier series (26) which amounts to 
choosing the exponent n in Equation (26) so that \k+gmax\s would be greater 
than the position of the first node of We find this criterion to be useful 

provided anisotropy in reciprocal space is accounted for. This is accomplished by 
using the minimum reciprocal lattice vector on the surface of maximal reciprocal 
lattice vectors, rather than simply using Qmax- 

Notice that this criterion is a criterion for N — £+n+l. The basis Fourier 
components are simplified, and the amount of information stored reduced, by 
simply using a single argument for all bases; i.e. all bases use the same value of N . 
It is also possible to use a single radius s, less than or equal to the smallest muffin- 
tin radius, since the only requirement is on the pseudo bases in the interstitial 
region. In practice, a few radii are desirable if large and small atoms are present in 
the same calculation, since small radii give less eonvergent Fourier coefficients. 
In any event, no more than a few radii arc necessary to handle systems with 
many atoms. Notice also that local coordinates have been left out of (26). The 
resulting potential matrix may be easily rotated to local coordinates at the end 
of the calculation. 

As expressed in (26), the Fourier components are products of phases 
g-i(fe+g)-x^ which scale like the number of atoms squared (the size of the reci- 
procal lattice grid grows linearly with the number of atoms), and a function of 
lattice vectors and a few parameters, which scales linearly with the number of 
atoms. The phase factors are simple to calculate by accumulation and need not 
be stored. 

The potential in the interstitial region is similarly obtained from a “pseudo- 
potential” V that equals the true potential in the interstitial region and has 
rapidly converging Fourier coefficients: 



V{r) = V{r) 



V{r) ^J2^iS)Ds{r) 



Ds 



ges 



(27a) 

(27b) 

(27c) 



The sum in Equation (27b) is over stars S of the reciprocal lattice. 

Integrals over the interstitial region are performed by convoluting the po- 
tential with an interstitial region step function and integrating over the unit 
cell: 



The potential matrix element is calculated by convoluting the convoluted po- 
tential with a basis, and performing a direct product between convoluted and 
unconvoluted bases. If basis functions are calculated v? reciprocal lattice vectors. 
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the interstitial potential will be calculated on (2n)^ vectors. The convolution is 
exact if it is carried out on a lattice containing (4n)^ vectors. The size of the 
set of reciprocal lattice vectors necessary to converge the total energy using 
this treatment of the interstitial region varies from between ^ 150 — 300 basis 
plane waves per atom, depending on the smoothness of the potential and the 
convergence required. 

Another way of integrating over the interstitial region, more usual in site- 
centered methods, is to integrate Fourier series over the unit cell and subtract 
the mufhn-tin contributions with pseudo-bases and pseudo-potential expressed 
as an expansion in spherical waves. The convolution has an advantage in acting 
with a single representation, and, given a finite representation for bases and 
potential, the convolution may be done exactly. 

Empty spheres are never used with this scheme. Bases, and the charge den- 
sity and potential are calculated as accurately as necessary using the scheme 
described above and a basis set expanded with tail parameters and energy sets 
has proven to be flexible enough to accurately describe the contribution of the 
electronic states in the interstitial region. 



4 Charge Density 



When a solution to the wave equation at every physical energy is available, the 
charge density may be obtained from a set of energy-dependent coefficients. The 
spherically symmetric charge density in a muffin-tin sphere, coupled with an 
i — projected density of states, is an example. In a variational calculation, as is 
being described here, all that is available is a (variational) solution to the wave 
equation at a set of discreet energies, and the charge density must be obtained 
simply from the square of the eigenvectors, or equivalently from expectation 
values of occupation numbers. 

Having calculated a set of eigenvalues and eigenvectors A of the generalized 
eigenvalue problem, the charge density in the interstitial region is 



n{r) I 
b(5) 



J2n{S)Ds{r) 

s 

[ d^re-^s"^\Y,Mk,r)Ai{nk) 

ges nk , 



(28a) 

(28b) 



where Ng is the number of vectors in the reciprocal lattice star S. The square of 
the wave function is obtained by convoluting the Fourier components of ip with 
A, Fourier transforming, and taking the modulus. 
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In the muffin-tin spheres the charge density is 

nht{rr)Dht(Prrr) (29a) 

Vr<St ^ 

h 

nht{r) = '^'^Uu'{ei',r)Mht{e£,e’l')U'^^{ei,r) (29b) 

ee e'V 

Mht{e£,e't') = alt{mh)G{£, m; m'; ih, rnh) (29c) 

mhmm' 

X y~^ WnkVrlm{^)'^\iim'i^ ) 

nk 

^r£m(^) (^; ^2 ) ^ (29d) 

i 

The process of calculation is evident in the sequence of equations. 

5 Core States 

Core states, even spherically symmetric complete shells, contribute non-miiffin- 
tin components to the interstitial region and to muffin-tin spheres surrounding 
other sites. Whether it is essential to include this contribution depends on the 
size of the contribution, and any sizable contribution implies that there are states 
being treated as localized which aren’t localized within the scope of the calcula- 
tion. Nevertheless, confining states to the core is often useful, and including the 
core contribution to the full potential is not difficult. One possibility, the one 
used in this method, is to fit the part of the core electron density to a linear 
combination of Hankel functions, and expand this density in the interstitial re- 
gion as a Fourier serie and in the muffin-tin spheres in a harmonic series, in the 
same way the basis functions are treated. 

6 Potential 

6.1 Coulomb Potential 

The Coulomb potential is obtained by first calculating the Coulomb potential 
in the interstitial region, then, using the value of the interstitial potential on 
the muffin-tin sphere, calculating the potential in the spheres by a numerical 
Coulomb integral of the muffin-tin electron density for each harmonic. 

The interstitial Coulomb potential is calculated in a way similar to that 
suggested by Weinert [12]. Express the electron density as 

n(r) = n(r) + y^(n(r) - n(r))0(s< - rv) (30) 

Rt 

where h is the squared modulus of the pseudo-eigenvectors, which is equal to 
the true electron density in the interstitial region. The first term on the right- 
hand side of (30) has, by construction, a convergent Fourier series. The second 
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term is confined to muffin-tin spheres. To calculate the Coulomb potential in 
the interstitial region, this term may be replaced by any density also confined 
to the mufHn-tin spheres and having the same multipole moments. If a charge 
density satisfies these requirements and also has a convergent Fourier series, the 
Coulomb potential in the interstitial region may bo easily calculated from the 
combined Fourier series. Such a charge density can be constructed in a similar 
way to that detailed for the pseudo-bases. Construct a pseudo charge-density 
satisfying 



n(p){r) = n^P\ht,rR^)Dht{'Drrhr) (31a) 

Rt h 

= (31b) 

\ Sf / \ \ Sf / / 

0 = y d^r rlD’^^{'Drrr){h^^\r)—ri{r)+n{r)'j . (31c) 



This charge density has Fourier components 



i^ip) ( — 






-W-'Ti 



-if>'Dht{Vrg} 



47 t {Qht{n}-Qht{n}) 



r h 
X 



Vr. 









{2£h + 1)!! 

where the multipole moments Q are defined by 



Qht{n} = f Dht{r\)n{r) 

47T Jst>rr 



(32) 



(33) 



The Fourier components h^P'^{r) converge like provided attains 

it’s asymptotic form. The exponent n is chosen using the same considerations as 
for the pseudo-basis set. 

The Coulomb potential in the interstitial region is then given by 



Vc{r) 



X 



47re2(n(g)WPH5)) 



(34) 



From the Coulomb potential in the interstitial region follows the Coulomb 
potential on the surface of the mufhn-tin spheres. The coulomb Potential inside 
the mufhn-tin spheres is 



pC)(r) 



\rr<St 



^DhtiVrr-r) 

h 






fSt ATTr'^nhjr) 

3 2£h+l 

, 24+1 ■'Cl; 



(35) 
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where 



^ / _ drDU{V^f)V^^\r) (36) 

J r-i — Sf 

is the harmonic component of the potential on a sphere boundary. 



6.2 Density Gradients 



Gradients of the electron density are needed for the evaluation of gradient cor- 
rected density functionals. These functionals depend on invariants (with respect 
to the point group) constructed from density gradients {e.g. |Vnp). This re- 
duces computation significantly in the muffin-tin spheres, for if / and g are 
invariant functions {i.e. f{r) = fh[r)Dh{f‘)), and d = V/ • Vgi, then d{r) = 

Y^h^h{r)Dh{r) with 



47rr^ 

24+1 



Mr-) = Y. Y. 

h,h' — ±1 






(37) 



where the set of parameters I is easily calculable from 3j and 6j coefficients and 
integrals over the harmonic functions Dh, and 



Ak) _ 4n I rf - 4 k = l 
“ 2£+l jr/' + 4 + l k^-1 



(38) 



and similarly for g. 

Gradients of the interstitial charge density, represented as a Fourier series, 
are poorly represented by differentiating the series term by term. A stable repre- 
sentation of the density gradient that converges well is obtained by defining the 
derivative as the difference between adjacent grid points, divided by twice the 
grid spacing as suggested by Lanczos.[13] This is equivalent to differentiating, 
term by term, the Lanczos-damped series for the charge density. 

7 All-Electron Force Calculations 

7.1 Symmetry 

The set of internal forces acting on the atomic sites of a crystal is a symmetric, 
discrete function of atomic coordinates and has a spherical expansion on the 
crystal sites with the same coefficients as continuous symmetric functions (8a) 
and (8b). Since forces are vectors, their representation has ^ = 1, and if a site 
has no invariant harmonics with £ = 1, there is no force on that site. So the force 
on an atomic site may be expressed as 

fM= Y fhtY^MmUr 

h-.th = l 



m 



(39) 
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v/v 



exp 



Fig. 1. The deviation of the internal coordinates of rhombohedral BaTiOa from ideal, 
calculated using all-electron force calculations as a function of volume with both LDA 
(open symbols) and GGA (filled symbols) exchange-correlation functions. The grey 
filled symbols are experimental points[16]. The LDA equilibrium volume is .958 Ve^p; 
the GGA volume is 1.037 Yexp- The energy was also minimized with respect to the 
rhombohedral angle at each volume. 



where the coefficients a are as in (8b), the are spherical unit vectors, [14] and 
Ur is the transformation to local coordinates for spherical vectors. A force calcu- 
lation is, as much as possible, a calculation of the set {fht}] The size of this set 
is often much smaller than three times the number of atoms. The displacements 
of atoms allowed by symmetry also have the form of (39); 

St ^ ^ ^'^ht ^ ^ 
h:(h = l ™ 

Minimizing the energy with respect to the atomic positions is a process of 
finding the set {Srht} that gives fht = 0. 

7.2 Force Calculations 

The calculation of forces in an all-electron method has been nicely described 
by Yu et al. [15] for the LAPW method. In addition to the terms discussed in 
that paper, a force calculation using a site-centered basis has the additional, 
and significant, complication that the bases depend on atomic position not only 
through augmentation but also through parentage. 

The contributions to the total force on a site in an all-electron calculation 
follow directly from a derivative of the LDA total energy with respect to atomic 
positions. The terms listed by Yu et al. are 1) a “Helmann-Feynman” term, 
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dE/dr, which accounts for the explicit dependence of the energy functional on 
atomic positions, 2) an “Incomplete Basis Set” (IBS) term, which arises when 
derivatives of basis functions aren’t contained in the space covered by the basis 
set, 3) a core-correction term, arising because core states are calculated using 
only the spherical average of the potential, and 4) a muffin-tin term, a surface 
term arising from the change in integration boundaries when atoms are moved 
and the discontinuity of the second derivative of basis functions across muffin- 
tin boundaries. There arc two other terms to consider. The first arises when a 
calculation isn’t fully self-consistent, and has the form — fy (Vout — Vin)dn(r) /dr, 
where I4ut and V)n are output and input potentials. The second term arises 
from the way in which Brillouin zone integrals arc done. Whether by quadrature 
or linear interpolation, the result is a set of weights (occupations) multiplying 
quantities evaluated at discrete Brillouin zone points. The terms listed above do 
not take into account the change of weights with atomic positions. 

The evaluation of the IBS term in a method using site-centered bases is 
significantly more involved than in the LAPW method. This term has the form 

-Fibs = - ^ w'nfe ^ A>fc ( (ipi\E - enk\di/^j/dT) 

nk ij 

+ (dipi/dr\H - enk\ipj}^Aj,nk (41) 

where the A are eigenvectors. Both LAPW and LMTO methods have a depen- 
dence on atomic positions through augmentation (the expansion of the basis 
set in atomic-like spherical waves) in the muffin-tin spheres, and both methods 
have an implicit dependence of basis functions on atomic positions through self- 
consistency, a term largely ignored and usually negligible. A site-centered basis, 
however, depends on atomic positions also through it’s parent site (the site it’s 
centered on). The contribution from augmentation is fairly easily accounted for 
at the density stage of a calculation, after integrals over the Brillouin zone have 
been done. The parent contribution, however, requires evaluation at the part 
of the calculation where eigenvalues and vectors are obtained, which makes its 
calculation time consuming. 

There are four types of contributions to dip /dr: 

--^ipi{k,r) = + 6l*^'jipi{k,r) (42) 

S^^'^ipi{k, r) = 0{r e r) ptpiik, r) (43) 

6^^'>ipi{k,r) = 6{Ti,T)'^0{st>-rr')Vt'Liei,rr')f2t'£{ei,Ki) 
t' L 

X ( ° 

\-i'VrBL,Li {Ki,T'-Ti, k) 

5f'^ipi{k,r) = 0{st-‘i\)^^pEtL{ei,r^)Uu{ei,Ki)^L,LMi,T-Ti,k) (45) 

L 
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5[^'>'4>i{k,r) = -6>(,st-rv) ^UtL(ei,r^)l7t£(ei, Ki) 

L 

X ( ^ 

\-iVrBL,Li{K.i, T-Ti, k) 

where p is the momentum operator — iV. The first two terms, Equations (43) 
and (44), are parent terms, changes in a basis due to a change in the site the 
basis is centered on. The first term. Equation (43), is the derivative of the wave 
function in the interstitial region (Equation (10) with respect to its parent site. 
Since the gradient of a solution to the Helmholtz equation is a solution to the 
Helmholtz equation, matrix elements and — V^\p'tpj')^ are calcu- 

lated as integrals over the surface of the mufhn-tin spheres. As in Equation (22), 
when interstitial region tail parameters are the same, the evaluation requires 
derivatives of structure functions. Working out this contribution proceeds as in 
Equation (22), although arriving at a finite form requires identities such as 

'y ) ^ — 1 mi, — ^(^61 fe)f/ ( ^6“ 1, ^bi bUb- 1, M ) ^b 

~ + l mi, — ^)f/ ( ffe + l) bHb~l^y ^bi bblb, 1, 

rria -|-^,£l,mi, (^b> '^bi^')Q i^^ai ^ai -^oTl, ?Tl(j+p., 1, /X ) 

“ — 1 m„-|-^,£i,mi, (^6> fe)0 ( -^o, 'Tblai 1 , ?Tl(j+/X, 1 , /X ) 

(47) 

Potential matrix elements (^ipi\V are calculated using Fourier series as in 
Sect. 3.2 with gradients taken as discussed after equation (38). 

The second term, equation (42), is the analog of the first term in the muffin- 
tin spheres; i.e., this term is the derivative of a basis with respect to its parent 
site evaluated in the mufhn-tin spheres. This term requires the gradient with 
respect to atomic positions of the structure function B. This gradient is easily 
obtained from the structure function itself: 

d 

t; B^rn^C'm' ^) 

UU u—t — t' 

T-r', fc) 

g{e, m; £+1, m+p; 1, 

-K^g{£, m\ £-1, m+p; 1, ,„,(«:, r-r', fc)j 

T - xV 0 (48) 

If convergence with respect to £ on the left hand side of the structure function 
is sufficient for the energy, terms in £max + 1 in Equation (48) may be neglected 
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d{As-V) 




Fig. 2. Relaxation of a silicon 65 atom supercell containing a vacancy, a Si interstitial, 
and an As interstitial. Of the 106 internal coordinates in this cell, 104 were allowed to 
relax (2 coordinates were fixed to fix the center of mass of the crystal). The calculation 
used a simple Broyden’s method to zero atomic forces. 



in evaluating forces. As stated above, the evaluation of these terms is somewhat 
time consuming. 

Examples of the use of forces for structural relaxation are given in Figures 
1 and 2. Figure 1 shows deviations from ideal lattice positions calculated for 
rhombohedral BaTiOs as a function of volume compared to experiment [16]. 
The rhombohedral angle was also relaxed at each volume in this calculation. 
The Ti coordinate is a displacement along [111]. The oxygen displacements Ax 
are along face diagonals while Az is toward the cell center. These calculations 
included Ti 3s and 3p and Ba 5s and 5p along with the usual valence bases in 
a single, fully hybridizing basis. At convergence, forces on internal coordinates 
were less than 1 niRy/Bolir. Figure 2 is a calculation of structural relaxation of 
As-vacancy-interstitial complex in Si. To a sixty-four atom Si supercell was added 
an As impurity at a tetrahedral interstitial position and a Si interstitial at an 
exchange position both surrounding a vacancy. The crystal, far from equilibrium, 
was then allowed to relax. Two internal coordinates (of a total of 106) were fixed 
to fix the center of mass of the crystal. The energy was minimized with respect to 
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the other 104 internal coordinates by zeroing the forces (to with 1 mRy/Bohr). 
The forces were zeroed using a simple Broyden’s method. 



8 Conclusion 

In this article we have described our highly accurate full-potential LMTO me- 
thod for solving the Kohn-Sham equations. In particular, we have shown that 
by dividing the crystal space into non-overlapping “muffin-tin” spheres and an 
interstitial region, we can compute the charge density or the potential without 
any shape approximation, thus eliminating any need for empty spheres which 
are necessary in other LMTO implementations when the crystal is not closely 
packed. Another feature of our implementation is that we can describe multiple 
principle quantum numbers within a single, fully hybridized basis set. This is 
accomplished simply by using functions f and (f> calculated with energies {e„^} 
corresponding to different principal quantum numbers n to describe the radial 
dependence of a basis in the muffin-tin region. In the interstitial region our 
method uses “multiple k” basis sets, for a better description of the interstitial 
charge density. Highly accurate charge density can be obtained by systematically 
increasing tlie number of variational parameters k for eacli angular momentum 
of the basis set. 

The potential in a mufhn-tin sphere at t has an expansion in linear com- 
binations of spherical harmonics invariant under that part of the point group 
that leaves atomic positions invariant. The evaluation of the interstitial poten- 
tial matrix only requires a correct treatment of basis functions (and potential) 
in the interstitial region. We have used this degree of freedom to design “pseudo- 
basis functions”, equal to the true basis functions in the interstitial region and 
are smooth functions in the muffin-tin region, with the requirement that their 
Fourier transforms converge rapidly enough for practical use. 

The set of internal forces acting on the atomic sites of a crystal is a sym- 
metric, discrete function of atom coordinates and has a spherical expansion on 
the crystal sites with the same coefficients as continuous symmetric functions. 
The total force on a site is given by the derivative of the LDA total energy with 
respect to the atomic position. Our implementation of the forces is in many 
ways similar to that of Yu et al. for the LAPW method [15]. Because our basis 
set is a site-centered one, we are required to compute additional terms, which 
can be time consuming. These contributions to the forces are non existant in 
plane-wave based methods, such as the pseudo-potential method. In addition 
to the “Helmann-Feynman” term, which accounts for the explicit dependence 
of the energy functional A on atomic positions, the other contributions are: (1) 
an “Incomplete Basis Set” term, (2) a core-correction term, (3) a surface term 
arising from the change in integration boundaries when atoms are moved, (4) a 
term which arises when the calculation isn’t fully self-consistent, and (5) a term 
arising from the way in which the Brillouin zone integrals are performed. We 
have showed that the forces are accurate enough to relax atomic structures. As 
examples, forces have been used to optimize the internal coordinates of rhom- 




Full-Potential LMTO 



167 



bohedral BaTiOs as a function of volume and the geometry of a 65 atom As, 
vacancy, and interstitial defected Si supercell. Where experimental results are 
available, good agreement is obtained. 
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Abstract. The most popular electronic structure method, the linear muffin-tin orbital 
method (LMTO), in its full-potential (FP) and relativistic forms has been extended 
to calculate the spectroscopic properties of materials form first principles, i.e, optical 
spectra, x-ray magnetic circular dichroism (XMCD) and magneto-optical kerr effect 
(MOKE) . The paper describes an overview of the FP-LMTO basis set and the calcula- 
tion of the momentum matrix elements. Some applications concerning the computation 
of optical properties of semiconductors and XMCD spectra of transition metal alloys 
are reviewed. 



1 Introduction 

The density functional theory (DFT) of Hohenberg, Kohn, and Sham is the 
method of choice for describing the ground-state properties of materials [1]. 
However, in the initial derivation of the DFT, the eigenvalues are Lagrange mul- 
tipliers introduced to orthogonalize the eigenvectors, which in their turn are 
used to compute the total energy and the charge density. In this formulation 
the eigenvalues have therefore no physical meaning and should not be conside- 
red as excited states. Nevertheless, the DFT in the local density approximation 
(LDA) or in its spin resolved local density formulation (LSD A), has been used 
successfully to compute the excited states, namely, optical and magneto-optical 
properties, x-ray absorption and magnetic dichroism spectra. 

The LDA or LSDA were indeed intended to compute the ground-state pro- 
perties of materials, and their use during the last two decades has produced an 
excellent track record in the computation of these properties for a wide variety of 
materials, ranging from simple metals to complex semiconductor superlattices. 
However, it is now believed that the DFT can do more than computing the gro- 
und state properties. This is because the Kohn-Sham equations could be viewed 
as deriving form a simplified quasi-particle (QP) theory where the self-energy is 
local and time averaged, i.e., S{r,r',t) » I4c(r)(i(r — r')S{t), here I4c(r) is the 
local exchange and correlation potential as, for example, parameterized by Von 
Barth and Hedin [2]. Viewed in this way, the KS eigenvalues are then approxi- 
mate QP energies and could be compared to experimental data. This argument 
is supported by quasiparticle calculations within the so called GW approxima- 
tion of Hedin [3] showing that the valence QP energies of semiconductors are in 
good agreement with these obtained using LDA, and the conduction QP energies 
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differ by approximately a rigid energy shift [4,5], In the literature this shift is 
often called “scissors-operator” shift [6]. 

In the last few years spectroscopy is becoming the standard tool for measuring 
excited states of materials. Its owes its impressive advances to the availability of 
synchrotron tunable highly polarized radiation. In particular, the measurement 
of optical, magneto-optical properties as well as magnetic x-ray dichroism are 
now becoming routine tasks for probing the structural and magnetic properties of 
materials. Considerable attention has been focused on transition-metal surfaces 
and and thin hlms due to their novel physical properties different from that of 
bulk materials and due to potential industrial applications such as magneto- 
optical recording, sensors, or technology based on giant magneto-resistance. In 
this respect, theory is falling far behind experiment and it is becoming hard to 
give a basic interpretation of experimental data. 

This paper, which is far from being a review paper about calculated excited 
states, tries to bridge the gap between experiment and theory by describing a rat- 
her quantitative method for computing excited states of materials. This method 
uses the local density approximation and the linear muffin-tin orbital (LMTO) 
method. In the first part of this paper we introduce the density functional theory 
and the local density approximation and justify the use of LDA eigenvalues as ap- 
proximate excited states and relate them to quasiparticle energies. In the second 
part we describe the construction of the LMTO basis set within an all-electron 
full-potential approach [7,17] which will be used to determine the momentum 
matrix elements. We devote the third part to the determination of the momen- 
tum matrix elements. In the first part of the application section we present some 
examples of computation of semiconductors optical spectra [9,10], and leave out 
the optical properties of metals and magneto-optical properties of materials and 
refer the reader to Ref. [11,12,9,14-19]. In the second part of the applications 
we show some examples of x-ray magnetic dichroism calculations [20-22] . 

2 Density Functional Theory 

The density functional method of Hohenberg and Kohn [1] which states that the 
ground state total energy of a system of N interacting electrons in an external 
potential I4xt is a functional of the electron density p(r) does not provide an ana- 
lytical form of the functional [1]. This method remains numerically intractable 
without the Kohn and Sham introduction of the so called local density approxi- 
mation [1] in which the exchange and correlation functional appearing 

in the total energy: 



i?{n} = T{n} EjJd^rJ + E.^n} 

+ J dd- Kext(r)n(r) + £(I4xt) 



( 1 ) 
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is given by E^dn} = f dPr ea;c('n(r))n(r) whore e^c is the oxchango-eorrclation 
energy of a uniform electron gas of density n. Thus, Kohn and Sham constructed 
a set of self-consistent single-particle equations: 



(^-V^ + - T)'jijj{r) = eiV’j(r) 

where the density n(r) is given by: 



n(r) = ^6>(ei < EF)i^i{Y)%j}\{Y) 

i 



and 



v;xt(r) 




(2) 



( 3 ) 

( 4 ) 



B(yext) = e^EE(l - ^ (5) 

tR t'R' ' 

Instead of the true kinetic energy of the electron gas, Kohn and Sham used 
the homogeneous electron kinetic energy: 

T = E j dd (6) 

i 

This use of homogeneous-electron kinetic energy in the Kohn-Sham equations 
redefined the exchange-correlation function to be: 

Exc{n} = {Exc{n} + T{n} -T) = J d^r £a;c(n(r))n(r) (7) 

It is then crucial to use a good basis-set for the description of the electro- 
nic structure of realistic systems. The augmented plane wave [23] (APW), and 
the Korringa-Kohn-Rostoker [24] (KKR) methods can be used, in principle, to 
solve exactly the Kohn-Sham equations, however these methods are numerically 
involved and their linearization, introduced by Andersen is much preferable. 
Andersen linearization, has not only made the techniques for solving the band- 
structure problem transparent by reducing it essentially to the diagonalization 
of one-electron Hamiltonian, and cuts the cost of computation by at least one 
order of magnitude. The linearized versions of these two powerful methods arc 
the linear augmented plane wave (LAPW) and linear muffin-tin orbital (LMTO) 
methods, respectively [17]. 

In this paper, we will only use the LMTO method to study excited states 
of solids. The reason for this choice is that the LMTO method is the mostly 
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used method in computational electronic structure. This is due primarily to the 
use of atomic-sphere approximation (ASA) which made the LMTO method run 
fast even on today’s cheap personal computers. Due to this reduced computa- 
tional cost, the LMTO ASA method became the method of choice of researchers 
without access to supercomputers. 

3 Quasiparticle Theory and Local-Density Approximation 
Link 

The quasiparticle (QP) electronic structure of an interacting many-body system 
is described by the single-particle eigenstates resulting from the interaction of 
this single particle with the many-body electron gas of the system. The single- 
particle eigenstate energies are the results of solving a Schroedinger like equa- 
tion containing the non-local and energy-dependent self-energy instead of the 
exchange-correlation potential appearing in Kohn-Sham like equations: 



(T+Pff + Kxt)'f(r) + I d^r'S{r,r\E)i'{r') = Eir{r). 

Thus the self-energy E contains all many-body effects. Almost all ab-initio 
QP studies were performed within the so-called GW approximation, where the 
self-energy E is calculated within Hedin’s GW approximation. This method 
consists of approximating the self-energy as the convolution of the LDA self- 
consistent Green function G and the screened coulomb interaction W within 
the random-phase approximation. The QP eigenvalues arc often obtained using 
first-order perturbation theory starting from LDA eigenvalues and eigenvectors 
[4,25]. Although there are early calculations starting from Hartree-Fock [26] or 
tight-binding [27] methods. Nevertheless, the best results are based on a LDA 
starting point [4,25,28-30]. 

Thus the GW predicted optical excitations energies of semiconductors are 
within 0.1 eV form the experimental results and the surprizing fact is that the 
QP wave functions are almost identical to these produced within the LDA [4] 
(the wave function overlap is more than 99%). For a general review of GW 
calculations see the review by Araysetianwaii and Guimarsson [29] or by Aulbur, 
Jonsson and Wilkins [30]. 

It is clear that the quasiparticle Schroedinger equation resembles to the 
Kohn-Sham equation. Both equations describe a fictitious electron moving in 
a effective potential. The difference is that the self-energy is nonlocal and energy 
dependent whereas the LDA potential is local and averaged over time. This 
resemblance can be further pushed by noticing that the DFT can be used to ob- 
tain excitation energies. For example, the ionization energy, I, and the electron 
affinity, A, are difference between ground state energies: 



I = E{N - 1) - E{N), 



and 



A = E{N) - E{N + 1) 
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where N is the number of electrons of the system. And since the DFT gives 
the correct ground state energies it should produce, in principle, the correct 
ionization and electron affinity energies. For metals, the addition or removal of 
an electron from the system costs the same energy, and hence the ionization 
energy is equal to the electron affinity. For insulators, the energy gap makes all 
the difference and hence breaks this symmetry. Thus the energy band gap is 
given by: 

Eg = I - A = E{N + 1) + E{N - 1) - 2E{N) 

In practice, however, the calculation is often obtained within the LDA and the 
energy band gap is calculated as the difference between the lowest conduction 
band and the highest valence band. It was shown by Sham and Schliiter [31] 
and Perdew and Levy [32] that the calculated energy gap differ from the true 
band gap by an amount A even when the DFT is used without the LDA. The 
A value could range from 50% in the case of silicon to 100% in the case of 
germanium. For most of the semiconductors, the GW calculations show that 
the LDA eigenvalues differ form the GW quasiparticle energy by a constant A 
which is almost independent of the k-point. This finding is important and shows 
that the LDA eigenvalues have some meaning and could be used to calculate 
excited states. So as stated in the introduction, the Kohn-Sham equations could 
be viewed as deriving form a simplified quasi-particle (QP) theory where the self- 
energy is made local and time averaged, i.e., E(r, r', t) w I4c(r)d(r — r')5(t). This 
approximation is certainly good for metals where we have a good data base for 
excited state calculated within the LDA [11,12,9,14-19] and where the agreement 
with experiment is good. For semiconductors, this approximation is not bad 
either, provided we know the value of the discontinuity of the exchange and 
correlation. Usually, this value is provided by GW calculations or by experiment. 



4 The Full-Potential LMTO Basis Set 

In this section we describe the LMTO basis-set used to calculate the excited sta- 
tes of solids. We discuss the basis used for an all electron calculation where the 
potential is not supposed to be spherically symmetric nor of muffin-tin type. The 
use of a general potential makes the study of open structures possible without 
having to resort to the so-called “empty-sphere” approximation. To define the 
basis-set, we divide the space into non overlapping spheres called “muffin-tin” 
spheres and a region between these spheres which we call interstitial region. In- 
side the muffin-tin spheres the Schroedinger equation is solved at a fixed energy 
for each angular momentum I and variational parameter k (which is defined 
later). The linearization amounts to the use of a linear combination of the so- 
lution (j)i{e,r) of the Schroedinger equation for a fixed energy and its energy 
derivative (j)£{e,r) inside the muffin-tin spheres. These linear combinations mat- 
che continuously and differentiably to an envelop function (spherical function) 
in the interstitial region. The Bloch wave function in the interstitial region is 
given by a linear combination of these Hankel functions centered at each site: 
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V>i(k,r) = (/ti,r-Tj-R) (8) 

R 

where i stands for the number of the basis function quantum numbers (these 
numbers are {t, L, r, {e^t}}), where t is the site number, L = {£, m) groups the 
two angular quantum numbers, and is the linearization energy for a particular 
atom type t and angular momentum number £. The envelop functions are defined 
as ICL{K,r) = ICi{K,r)yLir). 



yemir) 






(9) 


ICi(K,r) 


\ ne{Kr) - ije{Kr), 


if > 0 

, if < 0, (k = i K ) 


(10) 


Je{n,r) 






(11) 



Here ri£ is the Neumann function and ji Bessel function for the angular 
momentum £, and are the spherical harmonics. 

To get the differentiability of the wave-function at the boundary of the muffin- 
tin spheres, we write the envelope function inside the muffin-tin spheres. The 
envelope function for a muffin-tin sphere r' is given by: 



= ^ 3^L'(fr') rr)5{T, t')5{L, L') 

L' 

+ Je'{R,rr)BL',L{r’-T, K,k)j 



(12) 

(13) 



To produce smooth basis functions we require that the basis function is diffe- 
rentiable at the boundary of each muffin-tin sphere, i.e., that a linear combination 
of (p and (j) matches continuously and differentiably K, and J at the boundary of 
the parent sphere and other spheres, respectively. Using these matching conditi- 
ons at the muffin-tin spheres, the Bloch wave function inside a muffin-tin sphere 
T of the unit cell at the origin is given by [7]: 



V’*(k,r) 



yL{'DrVr)Ui{eiu, rr)£2{£t, eeuRi)BL,L,{T-Ti,Ki, k) (14) 

L 



where 



Ue{e,r) 



f Me,r)\ 
\<j)e{e,r) ) 



(15) 
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m , -) - 5 ; ( ) (WU.,)^ /»' - r,) ( 16 ) 

( 11 ) 

To add the spin dependence to the basis-set, the Bloch wave function is 
multiplied by the eigenvector of the Pauli spin operator 



ipaiK^) = iA(k,r)?7„ 



such that: 



n ■ cr?7±i = (±l)r/±i 

where r] is the quantization axis chosen in advance. 



5 Dielectric function 



5.1 Dynamical Dielectric Function 

Here we give a review of the determination of the dielectric response of a semi- 
conductor due to the application of an electric field. We expend the description 
of our published work [10] by giving more details concerning the calculation of 
the momentum matrix elements. 

An electromagnetic field of frequency lu, and a wave vector q + G interacting 
with atoms in a crystal produces a response of frequency uj and a wave vector 
q + G' (G and G' being reciprocal lattice vectors). The microscopic field of wave 
vector q + G' is produced by the umklapp processes as a result of the applied 
field Fo(q + G,w) 



Eo{q + G,u;) = ^ ec,G' (q, + G', a;) (18) 

G' 

where F(q + G, w) is the total held producing the non-diagonal elements in the 
microscopic dielectric function £G,G'(q, The microscopic dielectric function 
in the random phase approximation is given by [33]: 

^G-G'(q>^) = - r2]q + G|lq + G'| 

^ /ragk+q ~ fn,k 

k n n' -^n,k + iS 

{n', k + k)(n, k|e-*(‘i+°')‘'|n', k + q) 
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Here n and n' are the band indexes, /„^k is the zero temperature Fermi 
distribution, and 12 is the cell volume. The energies and the the crystal 
wave function |n,k) are produced for each band index n and for each wave 
vector k in the Brillouin zone. 

The macroscopic dielectric function in the infinite wave length limit is given 
by the inversion of the microscopic dielectric function: 



e{uj) = lim ^ 

q^O e 



G,G' 



1 

(q>w)]o.o 



= «o,o(w) - lim Y] eo,G(q,w)Te G,(q,w)cG',o(q,w) 

q— >0 ^ ^ ’ 

G,G'/0 



(20) 



where Tq q, is the inverse matrix of Tg_g' containing the elements cg,g' with 
G and G' ^ 0. The first term of this equation is the interband contribution 
to the macroscopic dielectric function and the second term represent the local- 
field correction to e. The most recent ab-initio pseudopotential calculation found 
that the local-field effect reduces the static dielectric function by at most 5% [6] . 
Previous calculations with the same method have also found a decrease of e^o 
by about the same percentage [4,34]. For insulators the dipole approximation of 
the imaginary part of the first term of equation (21) is given by [35]: 



^ r 

£2(w) = ^ / dk|(n,k]v|n',k)p/„,k(l - /n',k)< 5 (ek,n',« - fuv) , (21) 

n,n' 

Here v is the velocity operator, and in the LDA v = p/m (p being the momen- 
tum operator), and where ek,n,n' = A’ngk — i'n.k- The matrix elements (nk|p|n'k) 
are calculated for each projection pj = ^dj, j = x or y and z, with the wave 
function jnk > expressed in terms of the full-potential LMTO crystal wave fun- 
ction described by equations (14) and (8). The k-space integration is performed 
using the tetrahedron method [36] with a large number of irreducible k points 
the Brillouin zone. The irreducible k-points are obtained from a shifted k-space 
grid from the high symmetry planes and F point by a half step in each of the 
kx, ky, and kz directions. This scheme produces highly accurate integration in 
the Brillouin zone by avoiding high symmetry points. 



5.2 Momentum Matrix Elements 

To calculate these matrix elements we first dehned a tensor operator of order one 
out of the momentum operator Vq = Vz = Jh and V±i = T;^(^ 

The muffin-tin part of the momentum matrix elements is calculated using the 
commutator [V^,a;^] = 2V/z so that: 

j dY(pri'{r) y4>re{r)Yim{^'^) = 

ft -n) 



( 22 ) 




176 



M. Alouani and J. M. Wills 



where g, are the usual Gaunt coefficients, and 5'^ is the radius of the 
muffin-tin sphere of atom t. In the interstitial region the plane-wave representa- 
tion of the wave function (see equation 8) makes the calculation straightforward, 
but a special care has to be taken for the removal of the extra contribution in 
the muffin-tin spheres. However, we find it much easier and faster to transform 
the interstitial matrix elements as an integral over the surface of the muffin- 
tin spheres using the commutation relation of the momentum operator and the 
Hamiltonian in the interstitial region. The calculation of the interstitial momen- 
tum matrix elements is then similar to the calculation of the interstitial overlap 
matrix elements. The k = 0 case has been already derived by Chen using the 
Korringa, Kohn and Rostoker Greens-function method [37]. We have tested that 
both the plane-wave summation and the surface integration provide the same 
results. 



-V^ pV' = 

A Hankel function can be integrated over a volume by knowing its integral over 
the bounding surface: 



j (frS/ {ipiVpiip2 - (VV’l') Pitp2) 

= {kI - kI) j (frip^piip2 



(23) 



The surface of the interstitial consists of the exterior of the muffin-tin spheres 
and the unit cell boundary. 

Over the surface of the muffin tins: the surface area is S^dfl and the normal 
to the sphere points inward 



[k\ - kI) j d^r 'ipiPiip2 = (24) 

At a muffin-tin sphere boundary the Bloch wave function is given by: 



tpi (k, r) = ^ (kj, r-Ti-R) 

Sr 



Sr 



(25) 



'y ) 3^rm(^)Kr(K:2, (t t^, k) 






where B£^y,m.^(T-Ti, Kj, k) 



Let W denote the Wronskian W (/, g) = fg' — f'g 



S{T,Ti)6{i,£i}S{m,m^) 



and K 
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We define then 



52 Wo = S^WiKj{K),Ke{K)) = (_°i J) 

and 



WreKl, K 2 



W{Kj{n)),Ke{K)-Wo 

2 2 



PV'ilr 



K, 



M im 



i~ lm~fi 



1 0 
0 K? 



<0 

0 1 



1 , m— /Lt; f, m; 1 , /x) 



£+ 1 , m— /x; £, m; 1 , 






then 



(26) 



where 



( '<PfP'^i)T = 

EE^.E 



■- 






X Q* 0 (^- 1 , rn-ii-, £, m; 1 , /x) 

X (^0 /t^) m-^; £, m; 1, /x) 

B£ 7 TT,,£,^rn,. k)) +Zi(/,z,/x,) 



(27) 



(«;/ - K^)‘^{f,^,Ki) = ^e^(Ti) 

+5;^+i™,_^,^^^^(ri-r/,fi;/,k)g(fi + l, £i, rm; 1, ii)k^ 

-Bi^-inH-ij„ifmf{n-Tf,Kf,'k)Q[ii-l, mi~ £i, m*; 1, /x)k^ 
M *- 

— 1t 71/ — ("^Z i ^)^ ( ^/ ) ^Z’ ^Z 1, 



(28) 



(29) 
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5.3 Velocity Operator and Sum Rules 

Equation (21) can not be used directly to determine the optical properties of 
semiconductors, when the GW approximation or the scissors operator is used 
to determine the electronic structure. The velocity operator should be obtained 
from the effective momentum operator which is calculated using the self- 
energy operator, i7(r,p), of the system [38]: 

V = p®*^/m = p/m + 9A'(r, p)/9p (30) 



GW calculations show that the quasiparticle wave function is almost equals to 
the LDA wave function [4,5]. Based on this assumption, it can be easily shown 
[38] that in the case of the scissors operator, where all the empty states are shifted 
rigidly by a constant energy A, the imaginary part of the dielectric function is 
a simple energy shift of the LDA dielectric function towards the high energies 
by an amount A, i.e., e^^(w) = ~ '^/^)- The real part of the dielectric 

function is then obtained from the shifted £2 using Kramers-Kronig relations. 
The expression of is given by: 



.QP - 



2e^ 

3lj^7 



E 



dk/„,k(l “ fn',k) 



(n,k|p|n',k)l^ 



(Ck,n 



A)£ 



(31) 



is very similar to except that one of the interband gap ek,n',n is sub- 
stituted by the QP interband gap ek,n',n + A. 

To test for the accuracy of the calculation within the LDA the f-sum rule: 



2 

3mn„ 



fn, k(l fn',k) 

k n,n' 



(n,k|p|n/k)p 

^k,n',n 



1, 



(32) 



where n„ is the number of valence bands, should be always checked to ensure 
the accuracy of the calculations. 

O P 

It is easily seen that the dielectric function £2 calculated using the scissors- 
operator shift does not satisfy the sum rule {top is the free-electron plasmon 
frequency) : 




uie2{ui)duj 




(33) 



because (i) £ 2 ^^ satisfies this rule, and (ii) £^^ is obtained by a simple shift of 
£2 by the scissors-operator A towards higher energies. The non simultaneous 
satisfaction of both the f-sum rule and the integral sum rule within the scissors 
approximation shows the limitation of this approximation. While the scissors 
operator approximation describes nicely the low lying excited states, which is 
seen in the good determination of the static dielectric function and the low energy 
structures, i.e. Ei and E 2 , in the imaginary part of the dielectric function, it 
seems to fail for the description of the higher excited states. This is not surprising 
because the higher excited states which are free electrons like are most probably 
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Ej of GaAs 




Energy (eV) 



Fig. 1. Calculated Imaginary part of the dielectric function of GaAs at the experimen- 
tal equilibrium volume both within LDA and shifted by 1.3 eV, compared with the 
experimental results of Ref. [39]. The experimental Ei is only slightly underestimated 
while E 2 is overestimated. Notice that the shifted dielectric function by 1.3 eV, which 
produces the correct band gap, overestimates the peak positions by about 0.3 eV. 
Excitonic effect should shift these peaks to lower values in agreement with experiment. 



well described within LDA and need no scissors-operator shift. This is supported 
by the fact that the the energy- loss function, within the LDA has it 

maximum roughly at the free electron plasmon frequency whereas within the 
scissors approximation its maximum is shifted to higher energies. For our purpose 
the scissors-operator shift remains a good approximation for the description of 
the low-lying excited states of semiconductors and their optical properties. 



6 Applications 

6.1 Optical Properties 

We have used our FP-LMTO method and the formalism outlined above to cal- 
culate the optical properties of materials [9-12]. In general our results arc often 
in good agreement with the experimental results. For semiconductors, however, 
good agreement with experiment is only achieved when the so called scissors- 
operator shift is used. Figure 1 presents our relativistic calculation of the ima- 
ginary part of the dielectric function of GaAs compared to the experimental 
results of Ref. [39]. The LDA relativistic results underestimates the band gap 
by about 1.3 eV. When the imaginary part of the dielectric function is shifted 
to higher energies by 1.3 eV the results the E\ and E 2 peaks are overestimated 
in our calculation. One needs to shift the spectrum by less than the band gap 
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as done in Ref. [10] to produce good agreement with experiment. It seems then 
that the optical band gap is less than the band energy gap (1.5 eV). The opti- 
cal band gap is produced by interband transitions to the low lying conduction 
states. Excitonic effects are therefore important and are responsible for the re- 
duction of the energy gap of semiconductors. It is interesting to notice though 
that the static dielectric function are in good agreement with experiment for 
GaAs, Si, and Ge when the shift correspond to the energy band gap obtained 
from photoluminescence [6,10]. 

More interesting are the wide band-gap materials where the LDA calculated 
static dielectric function is in good agreement with experiment despite that 
the band gap is still underestimated by LDA. Correcting the band gap using 
the scissors operator makes the static dielectric much small than the measured 
value. As an example of wide gap material, we present in Figure 2 and 3 the 
imaginary part of the dielectric function of GaN for the cubic (3) and wurtzite 
structure (B4). 



Table 1. Calculated static dielectric function too for GaN compared to pseudopotential 
(PP) results and experiment. For the wurtzite structure we have calculated eL for a 
polarization parallel to the xy plan and which is perpendicular. 





zinc-blende 

^OO 


wurtzite 

Coo 


Cqo 


PP 


5.74 


5.48 


5.60 


Present work 


5.96 


5.54 


5.65 


Expt. 




5.35 


5.35T0.2 



Table 1 shows that our LDA dielectric constant calculations arc in agreement 
with available experimental results and the pseudo-potential (PP) results [40] 
including local- field effects (an error about our calculation is reported in Ref. [40]; 
our value for cL is not 4.48 but 5.54 and the PP value should then be 4.48). 
It is interesting to notice that static dielectric is in good agreement for the for 
all the nitrides [40] while the band gap is underestimated. The scissors-operator 
shift fails to explain the static dielectric function of large gap semiconductor. 
Recently, both local-field effects and electron-hole interaction were included on 
an ab-initio computation of the dielectric function of few semiconductors [41,42] 
by extending the semi-empirical Hanke and coworkers approach [26,43] which is 
based on the solution of the Bcthc-Salpetcr equation [43]. The cxcitonic effects 
seem to improve significantly the agreement between theory and experiment. 
However for large band-gap semiconductors, such as diamond, the inclusion of 
the excitonic effects seem to underestimate the optical band gap by as much 
as 1 eV [42]. It is not clear from these calculations whether the static dielectric 
function for wide-band gap semiconductors is improved when excitonic effects are 
included. More theoretical work along these lines is needed to fully understand 
the dielectric function of wide-gap semiconductors. 
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Imag. part of the dielectric function of GaN 



zinc-blende and wurtzite structures 




Real part of the dielectric function of GaN 

zinc-blende and wurtzite structures 




Fig. 2. Calculated imaginary and real parts of the dielectric function of GaN in its 
cubic and wurtzite forms. The LDA band gap of the cubic phase is 1.8 eV and the 
wurtzite phase is 2.2 eV. 



6.2 Magnetic Circular Magnetic Dichroism 

X-ray absorption spectroscopy (XAS) probes selectively each core orbital of each 
atomic species in a material. Two decades ago the theoretical work of Erskine 
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and Stern show that the x-ray absorption could bo used to determine the x-ray 
magnetic circular dichroism (XMCD) in transition metals when left and right 
circularly polarized x-ray beams are used [44]. More recently these ideas were 
implemented experimentally and XAS was used to determine the local magnetic 
properties of each magnetic atomic orbital in a magnetic compound [45,46]. Thus 
the circular magnetic x-ray dichroism is an important tool for the investigation of 
magnetic materials [45-56] , especially through the use of sum rules for the direct 
determination of the local orbital and spin contributions to the total magnetic 
moment [50]. 

Thole and co-workers show that the circular-magnctic-x-ray dichroism is re- 
lated to the magnetic moment of the photo-excited atom when the core electron 
is excited to the conduction states that are responsible of the magnetic properties 
of the material. On the theoretical side, Ebert and his co-workers [51,52] have de- 
veloped a fully-relativistic local-spin-density-approximation approach that was 
used with success to calculate the XMCD at the K-edge of Fe, the Ls-edge of 
Gadolinium, and Fe and Co multilayers. Wu et al used slab linear augmented 
plane wave method to study the L 2,3 XMCD of Fe [56]. Brouder and co-workers 
uses Multiple-scattering theory to solve the Schrodinger using spherical poten- 
tials and spin-orbit coupling as a perturbation in the hnal state [53]. Recently 
Ankudinov and Rehr used a method based on a non-relativistic treatment of pro- 
pagation based on high order multiple scattering theory and spinor-relativistic 
Dirac-Fock treatment of the dipole matrix elements to calculate the Fe K edge 
and Gd Lg edge XMCD [54]. 

The calculation of the x-ray absorption for left and right circularly polari- 
zed x-ray beams is implemented within the local-density approximation (LDA) 
by means of all-electron full-relativistic and spin-polarized full-potential linear 
muffin-tin orbital method (LMTO). The core electrons are spin-polarized and 
their electronic states are obtained by solving the full-Dirac equation, whereas 
for the valence electrons the spin-orbit coupling is added perturbatively to the 
the semi-relativistic Hamiltonian. The total Hamiltonian is then solved self- 
consistently. To calculate the polarization dependent cross-section we consider 
the case where the internal held polarizes the spins along the magnetization easy 
axis. With respect to this axis we dehned the left- and right-circular polarization, 
which correspond to the photon helicity {+h) {—h) respectively and the follo- 
wing dipole interaction: e±p = -^C^x ± iVy). The absorption cross-section 
for left (+) and right (— ) circular polarized x-ray calculated at the relativistic 

(€ ± i) core level and in the dipole approximation is given by: 

^^{j±mjJe±p\n'k){nk\pe±\j±mjJ6{uj - Enk + Ej^) (34) 

using LDA in conjunction with the relativistic full-potential LMTO technique. 

Figure 3. represent the K-edge x-ray absorption of Fe, for left and right 
circularly polarized light, compared to the experimental results. The agreement 
at low energy with experiment is good and start degrading at higher energies 
above the mean absorption peak. It is of interest to point out that the magnetic 
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K-edge x-ray absorption of Fe 




Fig. 3. Calculated x-ray absorption at the K-edge of Fe for left and right circularly 
polarized light compared to the experimental spectrum. The difference between the two 
spectra (barely visible on the graph) represents the x-ray magnetic circular dichroism. 



x-ray dichroism at the K-edge which is due to the spin polarization and the spin- 
orbit in the final state is very small in the case of Fe. The difference between the 
right and left circularly polarization of the light is not even visible on the graph. 
However, the x-ray magnetic circular dichroism can be measured and Figure 4 
shows a good agreement of the calculated dicliroic signal with the experimental 
results of Shiitz [46] . 

At the L2,3 edge of 3d transition metals the x-ray magnetic dichroism is 
much important because it is meanly due to the presence of the strong spin- 
orbit coupling in the initial 2p states (in the case of Fe the spin-orbit splitting 
between the 2^3/2 and 2pi/2 is about 13 eV). In Figure 5 we show the calculated 
x-ray absorption and XMCD at the Co in PtCo ordered alloy [21]. 

To compare the results with experiment we have to take into account the ef- 
fect of the core hole and the experimental resolution. This is done by convoluting 
the calculated spectra by a Lorentzian of widths of 0.9 eV and 1.4 eV for the L2 
and L3 edges, respectively, in addition a Gaussian broadening of 0.4 eV is added 
to take into account the experimental resolution. The calculation of the x-ray 
magnetic circular dichroic signal ignoring the electron-hole recombination effect 
provides a semi-quantitative agreement with the experimental spectra. Hence, 
we believe that the core hole effect represented here by a Lorentzian broadening 
plays a significant role in determining the correct L3/L2 branching ratio for 3d 
transition metals. The underestimation of the 7^2,3 branching ratio remains a 
challenge for theorists and further theoretical development along the line pro- 
posed by Schwitalla and Ebert [57] is needed to bring the theory at the level of 
the experiment. 



184 



M. Alouani and J. M. Wills 



K-edgeXMCD ofFe 




Fig. 4. Calculated x-ray magnetic x-ray dichroism at the K-edge f Fe compared to the 
experimental spectrum of Shiitz[46]. 




Energy (eV) 

Fig. 5. Calculated x-ray absorption and magnetic x-ray dichroism at the L 2 ,s-edge of 
Fe compared to the experimental spectrum of Grange et aZ. [21]. 



For the 4cZ-transition metals, the core hole is deeper, and the agreement with 
experiment of the XMCD is satisfactory. Figure 6 shows the calculated XMCD 
at the site of Pt of the CoPt ordered alloy. 

In contrast to what is obtained for Co, the results for the Pt site show a much 
better agreement with experiment, due to the fact that the core hole effect is 



Excited States 



185 




Fig. 6. Calculated x-ray magnetic x-ray dichroism at the L2,s-edge of Pt compared to 
the experimental spectrum of Grange et al. [21]. 



less intense (core hole much deeper than that of Co). For the Pt atom we used 
both a Lorcntzian (1 cV) and a Gaussian (1 cV) to represent the core hole effect 
and a Gaussian of 1 eV width for the experimental resolution. The experimental 
and theoretical L 2 and L3 edges are separated by a spin-orbit splitting of the 2p 
core states of 1709 and 1727 eV respectively. The width of both L 2 and L3 edges 
is comparable to experiment, but the calculated L 2 edge is much larger. This 
prodTices a calculated integrated branching ratio of 1.49 which is much smaller 
than the experimental ratio of 2.66. Here again the theory is underestimating 
the branching ratio. 



7 Conclusion 

We have reviewed the FP-LMTO method and the implementation of the optical 
properties and x-ray magnetic dichroism within the local density approximation. 
We have showed that the momentum matrix elements can be evaluated as a 
muffin-tin contribution and a surface term. The method has been successfully 
used to compute the optical properties of metals [11,12], semiconductors [9,10] 
and magneto-optical properties [11] of transition metals alloys, as well as x-ray 
magnetic circular dichroism [20-22] with high precision. 

For small-gap semiconductors a scissors-operator shift should be used to re- 
produce the static and dynamic dielectric function [10]. Excitonic effects seem to 
be important in reproducing the correct optical energy gap [41,42]. For wide-gap 
semiconductors the local-density approximation (LDA) static dielectric function 
is in good agreement with experiment and no scissors-operator shift is required 
despite the underestimation of the band gap by LDA [40]. 

For the computation of the x-ray magnetic circular dichroism the agreement 
with experiment is rather good [20-22,51-55]. However, the so called branching 
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ratio is underestimated by the theory. More theoretical work where the electron 
core-hole interaction is taken into account is needed to bring the theory at the 
quality level of experiment [57]. 

Part of this work was done while one of us (M.A) was at Ohio State University 
and were supported by NSF, grant number DMR-9520319. Supercomputer time 
was granted by CXUSC (project geml917) on the IBM SP2 and by the Universite 
Louis Pasteur de Strasbourg on the SGI 02000 supercomputer. 
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Fully Relativistic Band Structure Calculations 
for Magnetic Solids — Formalism and Application 
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Abstract. Relativistic effects, in particular the spin-orbit coupling, give rise for ma- 
gnetic systems to a great number of interesting and technologically important pheno- 
mena. The formal and technical aspects of corresponding fully relativistic theoretical 
investigations are reviewed. The properties of the underlying Dirac equation, set up 
within the framework of density functional theory (DFT) are discussed together with 
the Breit-interaction and Brooks’ orbital polarization mechanism. As an example for 
a corresponding band structure method, the Korringa-Kohn-Rostoker (KKR) Green’s 
function method is adopted. In particular, some technical aspects specific to this tech- 
nique are discussed. The numerous applications that will be presented are primarily 
meant to demonstrate the many different facets of relativistic - this means in general 
- of spin-orbit induced effects in magnetic solids. In addition, these also demonstrate 
the tremendous flexibility of band structure schemes based on the Green’s function 
formalism. 



1 Introduction 

Relativistic influences on the electronic properties of solids are known for quite 
a long time. One of the most prominent examples for these is the position of the 
optical absorption edge of Au. Compared to that of Ag this is higher in energy 
giving rise to the characteristic yellow colour of Au [1]. Another example is the 
relativistic contraction of s-type electronic shells, that has important consequen- 
ces even in chemistry [2]. In the case of Au this leads to the existence of Au^-ions 
in the compounds CsAu and RbAu, while corresponding Ag-compounds are not 
found. One of the early hints for the importance of spin-orbit coupling for the 
electronic band structure of solids stems from X-ray absorption experiments. Na- 
mely, it was observed by Cauchois and Manescu [3] that for the La-absorption 
spectrum of Pt there occurs a white line at the absorption edge while none was 
found for the L 2 -edge. Mott [4] ascribed this finding to the spin-orbit coupling, 
that should cause the d-states of Pt above the Fermi-energy to have predo- 
minantly d 5 / 2 -character. As a consequence of this and because of the dipole 
selection rules Aj = 0,±1 on expects strong absorption for the Ls-edge but 
not for the L 2 -edge. Another example for the influence of the spin-orbit cou- 
pling is supplied by de Haas-van Alphen-experiments on W. Here it was found 
that the electron- and hole-surfaces do not touch in the F -H-direction, as it was 
predicted from non-relativistic band structure calculations [5]. The mentioned 
experiments and many others clearly demonstrated the influence of relativistic 
effects and that way the need to account for them within a corresponding band 
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structure calculation. Nevertheless, it will in general depend on the specihe ex- 
periment one wants to describe to what extent this has to be done. For example, 
in the case of the quadrupolar and magnetic hyperfine interaction, that takes 
place in the vicinity of the nucleus where relativistic influences on the electrons 
are most pronounced, these will show up even for relatively light elements [6]. 

For many cases, it is well justified to deal with relativistic effects by introdu- 
cing corresponding corrections to the Schrodinger equation. One of the standard 
techniques to derive these is to apply a Foldy-Wouthuysen-transformation to re- 
move the coupling between the large and small component of the Dirac equation 
[7]. Because of technical problems connected with this approach - in particu- 
lar its convergence behavior - several alternative schemes have been suggested 
that in general also lead to a two-component formalism and that in some cases 
are also somewhat problematic. Among these for example the elimination me- 
thods [8-14], the Douglas-Kroll-Hefi-transf'ormation [15], or the ZORA-scheme 
[16] aim to derive an effective Hamiltonian that contains - compared to a non- 
relativistic Schrodinger-Hamiltonian - relativistic correction terms. In contrast 
to this, Gesztesy et al. worked out an expansion scheme for the corresponding 
Green’s function [17,18]. In spite of the obvious differences between the various 
schemes mentioned, they nevertheless all lead to the mass-velocity, the Darwin 
and the spin-orbit coupling terms [7] as the most important corrections, that 
are all proportional to (1/c^). Of course, the explicit expressions supplied by 
the various approaches differ to some extent. Nevertheless, one may unambi- 
guously call the first two correction terms scalar-relativistic because of their 
transformation properties [19]. Most important, these imply that the scalar- 
relativistic corrections leave spin as a good quantum number. Accordingly, it 
is very simple to include them in a band structure programme that is set up 
in a non-relativistic way - even for spin-polarized systems. However, one has to 
note that modifying the basic electronic Hamiltonian, one may have to adopt 
the expressions for operators representing physical observables [20,21], The most 
prominent example is that for the Fermi-contact part of the magnetic hyperfine 
interaction. Inconsistent scalar-relativistic calculations indicated for example for 
3d-transition metals relativistic corrections in the order of 40 % [22,23] while 
these amount only to about 10 % [24,25]. Another important example in this 
context is the electron-photon interaction operator. While this implies the .sel- 
ection rule Arris = 0 for the non-relativistic case, the corresponding relativistic 
corrections involve the spin-orbit coupling operator allowing for that reason for 
spin-flip optical transitions [21,26-28]. 

Compared to the scalar-relativistic corrections, accounting for the spin-orbit 
coupling within a band structure calculation is much more demanding because 
the corresponding correction operator contains explicitly the spin operator. An 
efficient way to include its effects within a band structure calculation, that is 
based on the variational principle, is to add the matrix of the corresponding 
operator to the Hamiltonian matrix in the secular equation [29,30]. For a pa- 
ramagnetic system this will increase the time to solve the secular equation by 
a factor of 8 because the dimension of the matrices is doubled. This computa- 
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tional effort can be reduced to some extent by applying the second variational 
technique [31]. Because in both cases the basis functions are set up only in a 
non- or scalar-relativistic level, it was often argued that this procedure will lead 
to problems if the spin-orbit coupling is very strong. However, recently it could 
be demonstrated that one can apply it without problems even to compounds 
containing Pb, for which the spin-orbit coupling for the p-states is quite high 
[32]. Of course, the most reliable way to deal with all relativistic effects is to 
start from the Dirac equation. In fact, for more or less any of the standard band 
structure methods corresponding versions have been developed during the last 
30 years (for example APW [33], OPW [34], KKR [35,36], ASW [37]). Dealing 
with paramagnetic solids these methods do not require more computer time than 
accounting for spin-orbit coupling in the variational step, because the dimension 
of the Hamiltonian matrix is just the same. In spite of this, the later approach 
seems to be much more popular because it allows one to discuss the final results 
in familiar terms. 

In contrast to the scalar-relativistic corrections, the spin-orbit coupling has 
many far-reaching consequences for the qualitative aspects of the electronic 
structure. The reason for this is the lowering in symmetry that is caused by 
the coupling of the spin and orbital degrees of freedom, that leads - among 
others - to a removal of energetic degeneracies. Another consequence of spin- 
orbit coupling is the occurrence of physical phenomena, that cannot be described 
within a non- or scalar-relativistic framework. A very well-known example for 
this is the so-called Fano-effect [38,39], that denotes the finding that one ob- 
tains a spin-polarized photo-electron current even for a paramagnetic solid [40] 
if one uses circularly polarized light. Of course, the spin polarization gets just 
reversed if the helicity of the radiation is reversed. For a magnetic solid, ho- 
wever, this symmetry is broken. A direct consequence of this broken symmetry 
are the magneto-optical Kerr-effect [41,42] in the visible regime of light and the 
circular magnetic X-ray dichroism [42,43] at higher photon energies. Of course, 
there are many other phenomena in magnetic solids, that are caused by the com- 
mon occurrence of spin-orbit coupling and spin-polarization, as for example the 
galvano-magnetic phenomena [44], the magneto-crystalline anisotropy [45], or- 
bital contributions to the hyperhne fields and magnetic moments [24] or electric 
field gradients in cubic solids [46]. 

The first band structure calculations aiming to calculate spin-orbit indu- 
ced properties in magnetic solids have been done by Callaway and coworkers 
[29,26,47]. These authors and later on many other authors [48-56] accounted for 
spin-orbit coupling in the variational step with the unperturbed Hamilton ma- 
trix describing a spin-polarized system. The basis functions used in this approach 
depend only on the orbital angular momentum quantum number I but carry no 
information on the spin-orbit coupling. This does not apply to the scheme sug- 
gested by the author [57] for which the role of the exchange splitting and the 
spin-orbit coupling are interchanged compared to the approach of Callaway and 
others. In particular this means that four-component basis functions are used 
that are obtained as solutions to the Dirac equation for a spin-averaged poten- 




194 H. Ebert 



tial, while the spin-dependent part is accounted for in the variational stop. An 
alternative way to include the influence of spin-orbit coupling already in the wave 
functions has been used by McLaren and Victora [58]. These authors adopted 
the formalism suggested by Koellling and Harmon [10] that works with two com- 
ponent wave functions in a (i, m;, m;,)-representation and that leads to coupled 
sets of radial differential equations if the spin-orbit coupling term is included. A 
very similar scheme has been used by Akai [59], who restricted the effect of the 
spin-orbit coupling to within a spin subsystem keeping spin as a good quantum 
number that way. To avoid the numerical effort in solving the coupled set of 
radial differential equations, Ankudinov [60] suggested an approximate scheme 
that is exact for vanishing spin-orbit coupling or vanishing exchange splitting 
and interpolates between these two extreme cases. Because the later three sche- 
mes account for spin-orbit coupling and exchange splitting already in calculating 
the wave functions, they can be used straightforwardly as a starting point for 
multiple scattering theory. 

To deal with all relativistic effects and magnetism - at least for transition 
metals this means first of all spin polarization - on the same level, it was sugge- 
sted already in the 1970s to work on the basis of the appropriate Dirac equation. 
Dealing with exchange and correlation within the framework of density functio- 
nal theory this leads in a rather natural way to current density functional theory 
(CDFT) [61] instead of the conventional spin density functional theory (SDFT) 
[62]. However, because of the many unsolved problems connected with this ge- 
neral scheme a relativistic version of spin density functional theory has been 
suggested [63,64]. Instead of dealing with the resulting Dirac-Hamiltonian for 
spin-polarized systems, Richter and Eschrig [65] suggested to use the correspon- 
ding squared Dirac Hamiltonian and developed a corresponding spin-polarized 
relativistic LCAO-band structure method. The hrst step to start from the spin- 
polarized relativistic Dirac equation itself has been done already 20 years ago 
by Doniach and Sommers, who derived the corresponding coupled radial Dirac 
equation [66]. The problem has later been investigated in more detail by Feder 
et al. [67] and Strange et al. [68]. In particular these authors could present the 
first numerical solution to the coupled radial equations for a single potential 
well. With this crucial step done, it is possible to derive for any band struc- 
ture method its spin-polarized relativistic (SPR) version. This has been done for 
example for the KKR [67,68], the LMTO [57,69], and the ASW [70] methods. 

In the case of the SPR-KKR the fc-space mode based on the variational 
principle [71] as well as the multiple scattering mode leading directly to the 
Green’s function [24,67,72] has been generalized accordingly. In particular the 
later approach (SPR-KKR-GF) was extensively used during the last 10 years. 
Some reasons for this are that one does not require Bloch symmetry for the 
investigated system and that one can link it straightforwardly to the Coherent 
Potential Approximation (CPA) alloy theory [73]. A major drawback of the 
SPR-KKR is its numerical effort required for complex systems. This problem 
could be overcome for many situations by the development of a TB-version 
[74,75] and the use of real space cluster techniques [76-78]. Nearly all SPR-KKR- 
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GF calculations performed so far were based on a muffin-tin or atomic sphere 
approximation (ASA) construction of the potential. A full-potential version of 
it has been worked out by various authors [79-81] and could be implemented 
recently in a self-consistent way [82,83]. Although in practice sometimes tedious 
because of the complex wave functions, it is possible to investigate more or less 
any property of magnetic solids using the SPR-KKR-GF formalism. In particular 
it is now one of the standard starting points to deal with magnetic dichroism in 
many kinds of electron spectroscopy as for example in X-ray absorption [84,85], 
in X-ray fluorescence [86], core- level XPS [87,88], valence band XPS [89], angular 
resolved valence band UPS [90], magnetic scattering [91], and the Faraday effect 
in the X-ray regime [92]. 

Most of the benefits supplied by the SPR-KKR-GF method can also be ob- 
tained using a corresponding version of the TB-LMTO-method [93-95]. Thus, it 
seems for many purposes and situations just a matter of taste which one of the 
band structure schemes is used. In the following the SPR-KKR-GF is described 
in some detail and a number of applications is used to demonstrate its great 
flexibility. 



2 Formalism 



2.1 Relativistic Density Functional Theory 



When dealing with the electronic structure of magnetic solids one usually neglects 
the influence of orbital magnetism on it. Accordingly, corresponding band struc- 
ture calculations arc in general done on the basis of spin density functional theory 
(SDFT) as it has been derived among others by von Barth and Hedin [62] in 
a non-relativistic way. This framework seems still to be acceptable when rela- 
tivistic effects are included by introducing corresponding corrections terms to 
the Schrodinger equation. If fully relativistic calculations are performed instead, 
in principle a corresponding basis shoTild be adopted to deal with many-body 
effects. The first step in this direction has been done by Rajagopal and Calla- 
way [61], who derived the SDFT starting from a relativistic level. These authors 
demonstrated in particular that quantum electrodynamics supplies the proper 
framework for a relativistically consistent density functional theory and derived 
the corresponding relativistic Kohn-Sham-Dirac equations [96,97]: 



ca ■ I -V + -Aeff(r) -I- /3mc^ + Ueff (^) 

1 c ' 



iFj(r) = 



( 1 ) 



with 



Ves{r) 



—e 



A°Jr) 



Aext(r’) 



1 

c 

1 

c 




\r — r'\ 
Jjr') 
|r — r'l 



9^xc[J^] ' 

9J0(r) 

dE^,[Jf^Y 

dJ{r) 



(2) 






(3) 




196 H. Ebert 



Here the !^i(r) are four-component wave functions (see below) with correspon- 
ding single particle energies e^. The matrices and (3 are the standard 4x4- 
Dirac matrices [7]. The effective scalar and vector potentials, Vea and Agfr, res- 
pectively, contain as a first term the corresponding external contributions. The 
second terms in Eqs. (2) and (3) are the familiar Hartree potential and the vec- 
tor potential due to the Breit-interaction, respectively. Finally, the third terms 
are caused by exchange and correlation with the corresponding exchange and 
correlation energy E^c[J^] being a functional of the electronic four-current J^. 
This central quantity that determines all properties of the system is given by: 



J° = 

i 

i 



( 4 ) 

( 5 ) 



where J°/c is identical with the familiar electronic charge density p, while the 
other components give the spatial components of the electronic current den- 
sity j. 

Thus, in contrast to non-relativistic SDFT, where the central quantities are 
the spin densities equivalently the particle density n and spin magne- 

tization density m, the relativistic formalism leads in a natural way to a current 
density functional theory (CDFT). Because of the problems connected with this 
very general scheme an approximate relativistic version of SDFT has been wor- 
ked out by several authors [61,63,98-100]. The first step in this direction is the 
Gordon decomposition of the spatial current density into its orbital and spin 
parts [61,97,101]: 



Jorb 



— <Ft/3 
2m 



- V --V 

i i 



2eA 



E H V X PaE 

2m 



(6) 



where cr is the vector of 4 x 4-Pauli matrices [7]. The coupling of the spin part 
Jspin (the second term in Fq. (6)) to the vector potential may alternatively 
be described by introducing the corresponding spin magnetization density 



i 



( 7 ) 



This leads to the coupling term 



m ■ J3eff , (8) 

with i?efE the effective magnetic held corresponding to A^^g. Thus, ignoring the 
orbital current density contribution jorb one arrives at a Kohn-Shani-Dirac equa- 
tion completely analogous to the non-relativistic SDFT Schrodingcr equation 
[97,101]: 

^ca • V + (3mc^ + Deff(r) -I- f3cr ■ Bes{r) 






(9) 
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with 






-Bext(T’) + 



dE^c[n,m] 

dm{r) 



(10) 



This approach has been suggested among others by MacDonald and Vo- 
sko [63,98], who justified this simplification by introducing a fictitious magnetic 
field that couples only to the spin degree of freedom as reflected by Eq. (9). 
This formal justification has been criticized by Xu et al. [102] because descri- 
bing a relativistic electronic system in terms of the particle density n and spin 
inagnctization density m alone the magnetic intcraetion part eonnccted with 
the electronic current density is not Lorentz invariant. This problem could be 
circumvented by Rajagopal and coworkers [100,102,103] by considering first the 
problem for the rest frame of an electron - for which jorb vanishes - giving a 
consistent justification for the use of relativistic SDFT. 

The orbital current density contribution to E^c - ignored within SDFT - 
has first been considered by Vignale and Rasolt on a non-relativistic level [104- 
107]. As one of the central quantities these authors introduce the paramagnetic 
orbital current density j'orb.p (see below). Because of the restrictions caused by 
the demand for gauge invariance this is replaced then by the so-called vortieity: 

iz = V X . (11) 

n[r) 

This step in particular allows to derive a local version of non-relativistic CDFT. 
A corresponding explicit expression for the corresponding iJxc has been given 
for the first time by Vignale and Rasolt [105]; 

E^,[n,v]=E^,[n,0\ + j dx(^-^^ 2 ^^ ^ ^ 

where Ts = and 

^ = 1 + 0.02764rs liir^ + O.OMOTr^ + ©(r^hirA (13) 



is the ratio of the diamagnetic susceptibility for the interacting and non-interacting 
electron gas. Later, more sophisticated expressions for E^c have been given [108]. 

The Vignale-Rasolt CDFT-formalism can be obtained as the weakly relati- 
vistic limit of the fully relativistic SDFT-Dirac equation (1). This property has 
been exploited to set up a computational scheme that works in the framework 
of non-relativistic CDFT and accounts for the spin-orbit coupling at the same 
time [109]. This hybrid scheme deals with the kinematic part of the problem in a 
fully relativistic way whereas the exchange-correlation potential terms are trea- 
ted consistently to first order in 1/c. In particular, the corresponding modified 
Dirac equation 



h 

—cot 



■ V + /Imc^ + Veff(j') + /3cr • Beff(r) + ^l3Hop,aP<j 

a 



'I'lir) 






(14) 
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incorporates a term 

^op,. = -^[A,c.(r),V]^ , (15) 

that explicitly represents the coupling of the orbital current and the exchange- 
correlation vector potential Axc,a- Since in the CDFT-formalism of Vignale and 
Rasolt Axc,ct is defined in a spin-dependent way, the spin-projection operator 
Pa = ^^2 appears in addition to Hop.a in E<1- (14)- 

Within the above approximate relativistic CDFT scheme the Breit-interac- 
tion has been ignored. This radiative correction accounts for the retardation of 
the Coulomb-interaction and exchange of transversal photons. A more complete 
version than that included in Eq. (1) is given by the Hamiltonian [2,110]: 



— 6 '^ 

^Breit = Tryr“l • “2 + OLI OL2 Oil RoL2 R with R = V2 



ri , 
(16) 



where the hrst part is the magnetic Gaunt part and the second one is the retarda- 
tion term. While inclusion of the Breit-interaction within quantum-chemical cal- 
culations for atoms and molecules is nearly standard [2], so far only one model 
[111] and one fully relativistic [112] calculation have been done in the case of 
solids. This is quite astonishing, because the Breit-interaction gives rise to the 
so-called shape anisotropy, that contributes in general to the magneto-crystalline 
anisotropy energy to the same order of magnitude as the spin-orbit coupling (see 
below) . 

An alternative to the CDFT approach is the heuristic suggestion by Brooks 
and coworkers [113-115] to use a fc-space method and to add a so-called orbital 
polarization (OP) term to the Hamilton matrix. This additional term has been 
borrowed from atomic theory and is meant to account for Hund’s second rule, i.e. 
to maximize the orbital angular momentum. During the last years, this approach 
has been applied with remarkable success to d- as well as f-electron systems 
and has been refined by various authors [116,117]. As it could be shown [118], 
Brooks’ OP-term can be formulated in a way that can be incorporated into 
the Dirac equation allowing that way for a corresponding extension of band 
structure methods based on multiple scattering theory [118]. For a d-electron 
system, i.e. for the case that orbital magnetism is primarily due to an open 
d-electron shell. Brooks’ OP-term takes the form —Bm^{lz)m,rni 5i2- This term 
describes a shift in energy for an orbital with quantum numbers I = 2. mi and 
ms that is proportional to the average orbital angular momentum (4)m, for the 
TUs-spin subsystem and the so-called Racah parameters Bm, [119] that in turn 
can be represented by the Coulomb integrals and . An operator that 
corresponds to this energy shift is given by 

'HZ = -B^^Ar){LUj.Si2, (17) 

with 
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where Pdm, describes the average charge density of a d-electron with spin cha- 
racter rUs. Obviously the operator has the form expected within CDFT 

for rotational symmetry [105]. This is emphasized by introducing the vector 
potential function = — i?°^(r)(4) that leads to the Dirac equation: 



yca • V -I- f3mc^ -1- (r) -|- /3<t • Beff(r) + fil^ 



iFj(r) = ei'P^{r) 



(19) 



For a further discussion of the connection of this equation with CDFT see below. 

In addition to the OP-fornialisni several alternative schemes have been sug- 
gested in the past to account within a relativistic band structure calculation 
for correlation effects not incorporated within the local approximation to SDFT 
(LSDA). For example the LDA+U-scheme has been applied to the compound 
CeSb [56], a system that has a maximum Kerr-rotation angle of 90° [120]. Si- 
milar experience has been made for other f-electron systems. Nevertheless, one 
should point out that by applying the LDA+U-scheme one leaves the framework 
of DFT. This does not apply to the SIC (self-interaction correction) formalism 
[96], for which a proper relativistic formulation has been worked out recently 
[121,122] and applied to magnetic solids [121]. 

From the above presentation it is obvious that relativistic effects influence 
the electronic structure in a twofold way. On the one hand side, one has the 
influence on the electronic kinetics, that is accounted for by working with the 
Dirac-formalism. On the other hand, relativity influences the electron-electron 
interaction via the retardation effect, the Breit-interaction and so on, leading 
to quite pronounced corrections for the exchange and correlation energy E^c 
compared to its non-relativistic counterpart. This has been studied in detail for 
the paramagnetic and the spin-polarized case for example by MacDonald and 
Vosko [63,98], Rajagopal and coworkers [100,102,103] and Engel and coworkers 
[123]. Until now, however, only very few investigations have been done on the 
importance of these corrections [19,124-126]. Nevertheless, one may conclude 
from these few studies that the absolute magnitude of total energies as well as the 
binding energies of tightly bound core states is affected in a rather appreciable 
way. However, for properties like the equilibrium lattice parameter or even for 
magnetic properties no pronounced changes have to be expected. For this reason, 
the use of parameterizations derived within non-relativistic SDFT seems to be 
well justified. 



2.2 Multiple scattering formalism 

Solution of the Single Site Dirac Equation The first step to solve one 
of the above Dirac-equations for a solid using multiple scattering formalism is 
to find the solutions to the corresponding Dirac-equation for an isolated po- 
tential well. For that purpose Strange ct al. [68] investigated the associated 
Lippmann-Schwinger-equation and derived a set of radial differential equati- 
ons for the single-site solutions. An alternative scheme has been used by other 
authors [66,67] dealing with the problem by writing as a first step the single- 
site Dirac equation in spherical coordinates [7]. For the spin-polarized (SDFT) 
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case this leads to (for simplicity it is assumed in the following that one has 
-B(r) = B{r)e^ and atomic Rydberg units will be used throughout): 



iysUrC 




---K 
r r 



V 



+ (/? — 1) — E 



= 0 



(20) 



Here the spin-orbit operator K is defined by 

K = I + 1) 

and the matrices 75 and are given by: 




and 



(21) 



(22) 



ffr = ^ ■ O' , (23) 

with <T the vector of the 4x4 Pauli matrices [7]. To find solutions to Eq. (20) 
one makes the ansatz: 



A 



(24) 



where the partial waves are chosen to have the same form as the linearly 
independent solutions for a spherical symmetric potential: 



i^A{r,E) 



( 9K{r,E)xA{r) \ 
\if^{r,E)x-A{r) ) 



(25) 



Here the large and small components are composed of the radial wave functions 
gK.{r,E) and f,^{r,E) and the spin-angular functions: 

XA{r)= ^ , (26) 

m,=±l/2 



with the Clebsch-Gordon coefficients C{l^j-, rrii, m^), the complex spherical har- 
monics y)™' and the Pauli-spinors Xm^ ■ The spin-orbit and magnetic quan- 
tum numbers k and /r, respectively, have been combined to H = (k, fj.) and 
—A = ( — respectively. The spin-angular functions X/i('^) are simultaneous 

eigenfunctions of the operators jz and K with j = 1 + |cr. The corresponding 
eigenvalues are j{j + 1 ), /i and —k and are connected by the following relations: 



_ f — / — 1 for j = Z + 1/2 

^ I ^ ~ 1/2 

J = l«:| - 1/2 
-j < M < +J 

i =i-s^, 



(27) 

(28) 

(29) 

(30) 
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where = k/\k\ is the sign of k and I is the orbital angular momentum quantum 
number belonging to X-a- 

Inserting the ansatz in Eq. (24) into the single-site Dirac-equation (20) and 
integrating over the angles leads to the following set of radial differential equa- 
tions: 



Pa. = —Pa. 



E-V 



+ 1 



Qau + '^{x-a\^7.\x-A')Qa’u 



Q'a. = -Qa. -[E -V] Pa. + B ^{xaW2.\xa')Pa'. , 



(31) 

(32) 



where the usual notation Pa. = rgA. and Qa. = erf a. has been used. The 
coupling coefficients occurring here are given by: 



{xaWzIxa') = G{K,K,g)5^^> (33) 

= I foi' k = ~k' -1 . (34) 

[ 0 otherwise 

Because of the properties of G(«;, n' , g) only partial waves for the same g get 
coupled; i.e. g is still a good quantum number. In addition, one can see that for 
the orbital angular momentum quantum numbers I and V of two coupled partial 
waves one has the restriction l — l' = 0, ±2, . . . , i.e. only waves of the same parity 
are coupled. Nevertheless, this still implies that an infinite number of partial 
waves are coupled. In practice, however, all coupling terms for which l — l' = ±2 
are ignored. A justification for this restriction has been given by Feder et al. 
[67] and Cortona et al. [127]. Results of numerical studies furthermore justify 
this simplification [128,129]. Altogether, this re.stricts the number of terms in 
Eqs. (31) and (32) to 2 if |/uj < j. For the case g = j, there is no coupling at all; 
i.e. the solutions have pure spin-angular character A. 

The procedure sketched above to derive, starting from the SDFT-Dirac equa- 
tion, the corresponding set of coupled radial differential equations can be used 
straightforwardly for more complex situations. In the case of the formally rather 
simple Brooks’ OP-formalism one has the additional term (see Eq. (17)) 
leading to the radial equations: 



P' 

^A. 



Q'a. 



~Pa. 

r 



E-V 



+ 1 



B 



Qa. + '^{x-a\(^7.\x-a')Qa', 



— 2 X^(X-n|A°P4|x-n')Qyi' 

^ A' 

-Qa. - [E-V] Pa. V B'^{xaWt.\xa')Pa'. 



-Y.{XA\A°^k\XA')PA' ■ 



(35) 



(36) 




202 H. Ebert 



Formally, the term can be seen to represent the coupling of the electronic 
orbital current to a vector potential A. A more general form of such a vector 
potential is encountered for example when one is including the Breit interaction 
in the Dirac equation. To deal with such a situation, it is most convenient to 
represent the vectors within the scalar product a - A (see Eq. (1)) using spherical 
coordinates and to expand the spatial dependency of the components of the 
vector potential A in terms of complex spherical harmonics: 

a • A = ■ (37) 

m LM 



When added to the SDFT-Dirac-Hamiltonian this leads to the following radial 
equations [112]: 



P'a. 



--Pa. 

r 



E-V 



Qau + ^ '^{X-aWz\X-A')Q A' v 



(38) 



--E E A'^m^X-AY, 

A' LMm 

Q'a. = jQau -[E-V] Pau + bY,{xaW.\xa')Pa'. 



L ^rn 



Xa')Pa 



(39) 



-fE E AAM{XA\Yl^CJr^\X-A')QA' 



A' LMm 



The Dirac equation Eq. (14) set up within the framework of CDFT can 
be treated in a completely analogous way resulting in a similar set of radial 
differential equations for large and small component wave functions. 



Because of the high symmetry of the orbital polarization Hamiltonian T-LAx 
coupling between partial waves beyond that caused by the spin-dependent part of 
the SDFT-Dirac-Hamiltonian is introduced. This holds also if a vector potential 
term is added to the Hamiltonian that stems from the Breit-interaction or from 
the CDFT-formalism, as long as one imposes for this rotational symmetry, with 
the symmetry axes coinciding with the direction of the magnetization. 

The problem of deriving a full-potential (FP) version of the KKR-formalism 
has been discussed in a rather controversial way during the last decades. Now, 
it is generally accepted that the scheme proposed among others by Dederichs, 
Zeller and coworkers [130,131] supplies a sound basis for FP-KKR band struc- 
ture calculations. This implies that in a hrst step space is subdivided into non- 
overlapping, spacc-hlliiig polyhedra usually obtained by means of the Wigner- 
Seitz-construction. The shape of these Wigner-Seitz cells is represented by the 
so called shape functions ©^(r) [132] with 

0 ws(^’) = ^dL{r)yL{r) , 

L 



(40) 
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where the step function ©ws is 1 for r within the cell and 0 otherwise. The 
functions are real spherical harmonics with L standing for {l.m). In addi- 
tion one defines the radius Ver of the smallest circumscribed sphere, for which 
G’ws('*') = 0 for r > Ter- Multiplying the potential V of the extended system in 
Eq. (9) with the function f^ws('^) centered at an atomic site n leads to the single 
site problem. To solve the corresponding single site Dirac equation the same an- 
satz as given in Eqs. (24) and (25) is made. This leads now to the coupled radial 
Dirac equations: 



^ + 1 Qa - 

^ A' 

Q'y^ = -Qa^EPa + J2 ^aa'Pa' ■ 



P' - --Pa 



(41) 

(42) 



Here the underlying Dirac-Hamiltonian has been restricted to the SDFT-form 
with the corresponding potential matrix elements defined by 

= (x±A|V;fr ± ctB\x±a') ■ (43) 



These are straightforwardly evaluated by expanding the potential into real sphe- 
rical harmonics: 



V{r) 


= J2^Ar)yL{r) 

L 


(44) 


B{r) 


= Y.BAr)yAr) 

L 


(45) 


with B{r) 


= B{r)B . 





Here it has been assumed that B (r) points everywhere along the same direction 
B. In the following applications B will be oriented along the z-axis. However, 
these are no necessary restrictions for the formalism; i.e. treatment of other 
orientations or non-collinear magnetic states can be straightforwardly accounted 
for. 

Compared to a muflin-tin or atomic sphere approximation potential con- 
struction inclusion of non-spherical terms in V and B obviously leads to further 
coupling. In practice, however, the number of coupled partial waves is restricted 
to 2 (/max + 1)^ by fixing an upper limit /max for the angular momentum expan- 
sion of the wave function in Eq. (24). For example, for /max = 2 one may have up 
to 18 partial waves coupled; i.e., one has to solve up to 36 coupled equations for 
the functions Pa and Qa- However, for a cubic system with B = z and /max = 2 
one has at most 3 partial waves coupled due to the high symmetry of the system. 



Single-site t-matrix and Normalization of the Wave Functions Working 
with one of the above versions of the Dirac equation and using the corresponding 
coupled radial differential equations a set of 2(/max + 1)^ linearly independent 
regular solutions <^a can be created by initializing the outward integration with 
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a selected spin-angular character A dominating close to the nucleus; i.c. one 
demands that: 

^A(r,E) = Y,^a'a{v,E) ^-^$AA{r,E) . (46) 

yl' 



After having solved all systems of coupled equations for the wave functions 
^A one gets the corresponding single site t-matrix by introducing the auxiliary 
matrices a and b [133-135]: 

aAA'{E) = -ipr‘^[h^{pr),^AA'{r,E)]r (47) 

bAA'{E) = ipr^[h^{pr),<pAA’{r,E)]r . (48) 

Here p = y^E{l + E/c^) is the relativistic momentum [7] and [. . . ]r denotes the 
relativistic form of the Wronskian [135]: 

'P 

[h\,4iAA']r = hf cfAA' - ^ ^ j 

The functions arc the relativistic version of the Hankel functions of the 
first and second kind [7]: 



h^{pr) 



l+E/d^ f hf{pr)xA{r) \ 

\Wkhf{pr)xA{r)) 



(50) 



Evaluating all functions in Eqs. (47) - (48) at rp = 'Cmt, tws or Tcv, resp., i.c. 
the mufRn-tin, the Wigner-Seitz or the critical radius depending on whether one 
is using the muffin-tin, the ASA- or full-potential mode, one finally has [135]: 



t{E) = ^^{a{E)-b{E))h-\E) . (51) 

By a superposition of the wave functions $a according to the boundary condi- 
tions 



ZA{r,E) = Y,ZA'A{r,E) 

A' 

- ^Ph\{r,E) (52) 

A' 

one gets an alternative set of linearly independent regular solutions Z a to the 
single site Dirac equation. These functions are normalized in analogy to non- 
relativistic multiple scattering theory according to the convention of Faulkner 
and Stocks [10] and allow straightforwardly to sot up the electronic Green’s 
function (see below). The additionally needed irregular solutions 3a are fixed by 
the boundary condition 

JA{r,EY^ u{r.E) (53) 



and are obtained just by inward integration. The functions jA occurring in Eqs. 
(52) - (53) are the relativistic counterpart to the spherical Bessel functions 
defined in analogy to Eq. (50) for [7]. 
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Manipulation of the SDFT-Dirac-Hamiltonian Dealing with relativistic 
effects by adding corresponding corrections to the Schrddinger equation allows 
one straightforwardly to investigate the role of these corrections individually. 
For the fully relativistic Dirac formalism sketched above this is obviously not 
the case. The only way to monitor the importance of all relativistic effects in 
a consistent way is to manipulate them simultaneously by scaling the speed of 
light c. To allow in spite of this for a separate investigation of the role of scalar- 
relativistic effects and the spin-orbit coupling an elimination scheme has been 
applied recently to the SDFT-Dirac Hamiltonian. 

Using this procedure an equation for the largo component wave function 
(!>{r,E) can be introduced that is still exact [137,138]: 



1 d 2 9 

r2 Qy, Qy. 



:^-SaT + SaBg, 



f A_1 

S'yi I dr r 



j 4>{i',E) =0. 



Here the abbreviations 



(54) 



T = E-V (55) 

E -V B 

= f 1 + ^(x-nkz|x-yi) (56) 

have been used, where Sa would be identical to 1 in the non-relativistic limit 
which is obtained for c — > oo. 

For the wave function (j){r,E) the ansatz 



<P{r,E) = J2^A{r,E) = 9A{r, E)xA{r) , (57) 

A A 

is made in accordance with the adopted T-representation. Inserting this ansatz 
into the wave equation (54) leads to the following second order radial differential 
equation: 

^A = Pa “ TPa + ^ Baa’ Pa' 

Sa dr r 



Pa+ 

Sat ^ 



(58) 



with the spin-orbit coupling operator K replaced using the operator 

i = k ~l = CT - I . 



(59) 



Inserting the proper values for the corresponding angular matrix elements 

Ua' = {Xa\K - 1|XA') = (-K - Pj^AA' , (60) 
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the exact second order differential equation for the major component is reco- 
vered. 

Replacing the spin-orbit coupling operator K in Eqs. (59) and (60) by its 
scaled counterpart 



Kx = xa ■ I (61) 

with the associated effective spin-orbit quantum number 

Kx = -1 X {I + k) , (62) 

allows one to scale the strength of the spin-orbit coupling separately. Obviously, 
setting X = \ nothing changes at all, while for a: = 0 one gets Kx = —1. This 
is just the value of the spin-orbit quantum number for s-states where there is 
no spin-orbit coupling. Therefore, replacing n in Eq. (60) by Kx switches the 
spin-orbit coupling off for any partial wave if x = 0 and reduces or increases the 
spin-orbit coupling strength else. 

To solve the resulting second order differential equation Eq. (58) for the wave 
functions Pa{x, E) the auxiliary function Qa{x: E) is introduced by the definition 




1 

^ ■ 



(63) 



This leads after some simple transformations to a coupled set of hrst order 
differential equations; 



P'a 



-—Pa + SaQa 



(64) 



Q'a = —Qa-TPa + Y. 

^ A' 



I (I -h 1) — Kx{Kx + 1 ) 




(65) 



Apart from allowing one to manipulate the strength of the spin-orbit cou- 
pling, Eq. (58) also permits to modify the form of the spin-orbit coupling ope- 
rator. For this purpose ^ is splited according to 



^ = a • I 

— 4“ (^x^x T ^y^y) (66) 

= Czz + ^xy 



into two parts. The first term, ^^z, gives rise only to a splitting of levels with 
different quantum numbers m;. Because no mixing of states with different spin 
character is introduced that way, rrig is left as a good quantum number. In 
contrast to this, the second term, ^xy, gives rise to a hybridization of different 
spin states while no obvious splitting of m/-levels is caused by it. Because the 
two parts of ^ have quite different consequences it is interesting to investigate 
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their effect separately by replacing ^ in Eq. (58) cither by or by ^xy The 
corresponding angular matrix elements to be inserted are 



(x/l l?zz Ix/l' ) — 



(X/l|?xyjX/l') — 






(2i + l) 






+pt/l 



-m 

1 
2 



^ Jk, 21+1 



for K ^ k' 
for K = k' 

for K ^ k' 
for K = k' 



(67) 



(68) 



A solution of the resulting second order differential equation for the two different 
cases can again be achieved by introducing the auxiliary function Q+ in Eq. (63). 
This leads to the following sets of coupled first order differential equations: 



P+ ~ Pa + SaQa 



(69) 



Q'a ~ ~ Qa ~ TPa + 'y ^ Baa' Pa' 

A' 

' + ^)Pa + ^ ^aa' Pa' 



Sa r 



(70) 



with = (xaIIzzIxa') or = (X/ilIxy Ix/i'), respectively. 

The final coupled radial equations obtained for the two manipulation schemes 
sketched above differ only with respect to the last term in Eqs. (65) and (70), 
respectively, from the original equation (31) for their small component wave 
function corresponding to the proper SDFT-Dirac-Hamiltonian. Implementation 
of the two manipulation schemes therefore requires only minor modifications in 
the corresponding programs. 

However, one has to keep in mind that Q/i(r, E) defined by Eq. (63) has not 
the meaning of a small component occurring within the bi-spinor formalism. For 
this reason the boundary conditions to match the wave functions to solutions 
outside the sphere boundary have to be specified by Pa{t,E) alone [57]. To 
set up the corresponding single site t-matrix tAA'{E) used within the KKR- 
formalism one therefore has to replace the relativistic Wronskian (see Ref. [135]) 
by its standard form g +{'<': E) ^9 'a{'^-. E)ji(r, E) with ji{r, E) the spherical 

Bessel function. Furthermore, one has to note that for the evaluation of the 
matrix elements of any operator it has to be transformed in such a way that no 
coupling of large and small component occurs. This applies, for example, to the 
operator a • A that describe the interaction of electrons with the vector potential 
A. In this case, for example, the V • A-form of the matrix elements can be used 
[42,139]. 



Green’s function The problem of setting up the electronic Green’s function 
G{r,r'E) for a solid on the basis of relativistic multiple scattering theory for 
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arbitrary scalar and vector potentials has been investigated in great detail by 
Tamura [79] . In analogy to the non-relativistic formalism of Faulkner and Stocks 
[10] the Green’s function G{r,r ' , E) can be written as: 

G(r,r',E) = Z^ir, E)T2i{E)Z%^r ' , E) 

AA' 

-J2[ZAr,E)jr{r',E)0{r'-r) 

A 

+J2{r, E)Z'A {r E)0{r - r') ] W (71) 

for r (r ') within the cell n (n'). Here the quantity t2^i(E) is the so-called scat- 
tering path operator [140] that represents all multiple scattering events in a 
many-atom system in a consistent way (see below). The wave functions and 
are the properly normalized regular and irregular solutions of the correspon- 
ding single site problem for site n (see above). 

The most important technical point to note is that in Eq. (71) the sign ” x” 
indicates that the wave functions Z^ and are the left-hand side regular and 
irregular solutions of the corresponding modihed Dirac equation [79]. Fortuna- 
tely, for the SDFT-Dirac-Hamiltonian these are obtained from the same radial 
differential equations as the conventional right-hand side solutions Za and Jn; 
i.e. from Eqs. (41)-(42) with the potential matrix elements replaced by V^'A- 
For highly symmetric systems one may have the situation that = V^,^. In 
this case Z^ and are obtained from Z and J by simple complex conjugation 
and transposition: 

Za{^,E) = Y^{gA’A{r,E)x\,{r); -i f a' a{t , E)x\,) (72) 

A' 

and 

JA^^E) = '^{gA'A{r,E)xA,{r);-ifA'A{r,E)xA,) , (73) 

A' 



since left and right hand side solutions are identical with respect to their ra- 
dial parts. This applies in particular to the single site problem with spherically 
symmetric potential terms V and B, but also to cubic systems with the magne- 
tization along the z-axis, as investigated here. Fortunately, this is still true if the 
OP-term is included in the Dirac equation (19) because here the relation (Eq. 
(11) in [79]) for the vector potential corresponding to the OP-potential term in 
Eq. (19) holds. 



Calculation of the Scattering Path Operator t ^ a ' The scattering path 
operator - introduced by Gyorffy and Stott [140] - transfers a wave with 
spin-angular character A' coming in at site n' into a wave outgoing from site n 
with character A and with all possible scattering events that may take place in 
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between accounted for. According to this definition it has to fulfill the following 
self-consistency condition: 

T =t dnn'-\-i T , (74j 

k^n 

where all quantities are energy dependent and the underline denotes matrices 
with their elements labeled by yl = («:,//). Here the single site t-matrix t” is 
fixed by the solutions to the single-site Dirac equation for site n. Furthermore, 
G"" is the relativistic real space Green’s function or structure constants matrix 
that represents the propagation of a free electron between sites n and n' [133]. It 
is related to its non-relativistic counterparts G^, = by the simple 

relation [80]: 

G'A" , = (1 + E/c^) S^cGTc'Sc'A' , (75) 

cc 

where L and L stand as usual for the sets (I, mi. ms) and {I, mi), resp., of non- 
relativistic quantum numbers. The matrix elements of the unitary transforma- 
tion matrix 5 occurring in Eq. (75) are given by the Clebsch Gordon coefficients 
ms, ms) [7]. 

For many situations it is often sufficient to consider a system consisting of 
only a finite number of atoms, as for example within the local interaction zone 
(LIZ) formalism [78] or EXAFS-theory [76,77]. In that case Eq. (74) can be 
solved by inverting the corresponding real-space KKR-matrix [77] : 

T=[m — G]~^ , (76) 

where the double underline indicates super-matrices with the elements being 
labeled by the site indices of the cluster. The elements themselves are matrices 
labeled by A as for example (G)”" = G”" with (G"” )aa' = G^,. The matrix 
m in Eq. (76) is site-diagonal and has the inverse of the single site t-matrix G 
as its diagonal elements; i.e. (m)”" = = (G)^^^rm'- 

Alternatively one may instert Eq. (74) repeatedly into itself to arrive at the 
scattering path expansion 

^nn j.n _|_ j.n ^ ^ 

k^n 

k^n l^k 

for the site-diagonal scattering path operator t”". Compared to the matrix in- 
version the scattering path expansion technique is highly efficient. On the other 
hand, one may encounter convergence problems using it. To avoid these problems 
several alternative and efficient schemes have been suggested in the literature 
[141,142] that have not been applied so far for relativistic calculations. 
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For ordered infinite systems Eq. (74) can be solved exactly by means of 
Fourier-transformation. For one atom per unit cell, the term is obtained 
from the Brillouin-zone integral 

rTA'iE) = 7^ / d^k[t-HE) - G{k, . (78) 

^BZ 

Here Rn{n') denotes the lattice vector for site n(n') and the relativistic k- 
dependent structure constant matrix G(fc, E) is connected to its non-relativistic 
counterpart in analogy to Eq. (75) for the real space formulation. 

As usual, group theory can be exploited to restrict the integration in Eq. (78) 
to the irreducible part of the Brillouin-zone, that depends on the orientation of 
the magnetization [143,144]. For cubic systems, the site-diagonal case n = n' has 
been dealt with in detail by Hormandingcr and Weinberger [145], while the site- 
off-diagonal case n ^ n' has been worked out by Zecha and Kornherr [146,147]. 

To deal with the electronic structure of surfaces within the framework of the 
spin-polarized relativistic KKR-formalism, the standard layer techniques used for 
LEED and photoemission investigations [148] have been generalized by several 
authors [90,149]. As an alternative to this, Szunyogh and coworkers introduced 
the TB-version of the KKR-method [74,150,75]. To invert the emerging layer-, 
fc||- and A-indexed KKR-matrix, that has in principle an infinite number of rows 
and columns, these authors applied techniques that are also used within the TB- 
LMTO-Green’s function formalism [95]. Finally, the scheme to deal with surfaces 
and layered systems developed by Dederichs and coworkers [151], that represents 
the vacuum region by layers of empty atomic sites, has been generalized recently 
by Huhne and Nonas [152,153]. 



Treatment of Disordered Alloys One of the appealing features of the mul- 
tiple scattering formalism described above is that it can be applied straightfor- 
wardly to deal with disordered alloys. Within the Coherent Potential Approxi- 
mation (CPA) [154] alloy theory the configurationally averaged properties of a 
disordered alloy are represented by a hypothetical ordered CPA-medium, which 
in turn may be described by a corresponding scattering path operator 
This operator is determined by the CPA-condition: 



xaT 



xbL 



_rm, CPA 



(79) 



where the matrices are dehned with respect to the index A and the binary alloy 
system has components A and B at relative concentrations xa and xb, respec- 
tively. The above equation represents the requirement that the concentration- 
weighted sum of the component-projected scattering path operators r””>“ should 
be identical to that of the CPA-medium; i.e. embedding an A- or a B-atom into 
the CPA-medium should not cause any additional scattering. 

The describes the scattering properties of an a-atom embedded in the 

CPA-medium, i.e. of a substitutional impurity and is given by the expression 



^nn.a ^nn.CPA 
T ’ = r 



T I /i — 1 / — I \ _nn,CPA' 

1 + (fa - fcPA) T J 



-1 



(80) 
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where and t^PA the single site t-matrices of the component a and of the 
CPA-medium, respectively. The later quantity is connected with the CPA scat- 
tering path operator by an equation analogous to Eq. (78) or (76), respectively. 
To obtain the quantities and £^j, given concentrations one 

must solve Eqs. (79), (80) and (78) or (76), respectively, iteratively. 

3 Applications 

3.1 Basic Electronic Properties 

Dispersion relations The impact of the spin-orbit coupling on the dispersion 
relations Ej(k) of spin polarized relativistic solids have been investigated by 
several authors in the past [71,138,155]. Corresponding results for fcc-Ni are 
shown in Fig. 1 for the magnetization M along the [OOlj-axis and the wave- 
vector k along the [lOOj-axis [138]. As one notes, spin-orbit coupling gives rise to 
a lifting of degeneracies (c.g. at A and B in Fig. 1, left) and causes hybridization 
or mixing of bands (e.g. at C, D, E and F) that simply cross within a non- or 
scalar relativistic treatment. In addition, one finds that for Bloch states I'Fjfc) the 
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Fig. 1. Dispersion relation Ej{k) of fcc-Ni for the magnetization M and the wave- 
vector k along the [001]- and [100]-axis, respectively. The panels show from left to the 
right results based on the full Dirac equation and those obtained keeping the zz- and 
xy-terms in Eq. (66). 



expectation value is not restricted to ±1 (see e.g. Ref. [65,71]), i.e. 

spin is no more a good quantum number. However, remarkable deviations from 
the values ±1 occur only in the region where bands cross if spin-orbit coupling 
is neglected. For this reason it is justified to attach the labels | and t to the 
bands to indicate their dominant spin character for a certain range of k. 

Keeping only the ^zz-part of the spin-orbit interaction the most important 
consequence is that now all states have pure spin character that cannot change if 
one goes along a certain band. However, this does not rule out the hybridization 
of bands induced by ^zz- As one can sec from the middle panel of Fig. 1 hybri- 
dization takes place at E and F. On the other hand, no hybridization is found 
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at C and D, where now bands of different spin character cross. Furthermore one 
notes that the splitting of the bands, e.g. at A, B, E and F caused by the ^^z-part 
is quite comparable to that due to the full spin-orbit interaction. 

Concerning the hybridization, the situation is more or less opposite to the 
situation for if the Cxypart is used. The right panel of Fig. 1 demonstrates 
that there is now a pronounced hybridization of bands of different spin character 
(C and D) - just as for the full spin-orbit interaction. While hybridization is also 
present at E and F, it is much less pronounced than for <^zz- Surprisingly, the 
splitting of the bands caused by ^xy, while being in general smaller than for 
^zz, is still quite appreciable. In spite of this, both parts have a rather different 
importance for many spin-orbit induced properties, as it will be demonstrated 
below. 

The influence of the spin-orbit coupling on the dispersion relation of ordered 
spin-polarized solids shown in Fig. 1 can also be demonstrated for disordered 
ones. Instead via the dispersion relation, the band structure is represented in 
the later case by means of the Bloch-spectral function A-Q{k,EY) that can be 
viewed as a fe-dependent density of states (DOS) function [156]. Due to the 
chemical disorder, A^{k,E-p) for a given energy is in general spread out in k- 
space, implying that the wave vector k is not a good quantum number. For an 
ordered system, on the other hand, the smearing out in fc-space does not occur 
and A'Q{k^E'p) can be written as a superposition of d-functions 5{E — Ej{k)) 
and the conventional dispersion relations are recovered. 

Fig. 2 shows results for spin-projected Bloch-spectral function A^ik, Ep) 
obtained for fcc-Feo. 2 Nio ,8 for the Fermi energy Ep, the wave vector k in the 
(OlO)-plane and the magnetization M along the [001]-direction [157]. As one 
notes, disorder has quite a different impact on the majority and minority spin 
Bloch-spectral functions. Nevertheless, there is a well-defined Fermi surface for 
both of them that - due to the specific composition - is very similar to that of 
pure Ni [26,158]. Comparing the details of the Bloch-spectral function for the 
two spin sub-systems a hybridization is recognized. This especially holds for the 
single majority sheet centered at the E -point that has a pronounced minority 
spin admixture. Because the majority spin states primarily carry the electric 
current [159] and because the spin hybridization leads effectively to a short-cut 
it should have a great influence on the electric resistivity. In fact, calculations 
of the isotropic residual resistivity of magnetic alloys using the SPR-KKR-CPA 
give always higher values than calculations that neglect spin-orbit coupling and 
make use of the so-called two-current model [160,161]. 

In addition to the spin hybridization, one notes a small anisotropy of the 
Bloch spectral function; i.e. it depends on the relative orientation of the wave 
vector k and the magnetization M. This applies not only to the location of 
A-B^k.E) in fe-space, but also for the width of the Bloch spectral function. 
The former property is also found for the dispersion relation of ordered sy- 
stems and clearly indicates the lowered symmetry of the system compared to a 
paramagnetic state. For the situation considered in Figs. 1 and 2 the symmetry 
is effectively tetragonal instead of being cubic. Changing the orientation of the 
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Fig. 2. Gray-scale representation of the Bloch spectral function AB{k,Ep) for fcc- 
Feo. 2 Nio ,8 for the energy fixed to the Fermi energy Ep and the magnetization M || [001]. 
The wave vector k is in the (OlO)-plane; i.e. the horizontal axis gives the component 
of k perpendicular to M, while the vertical axis gives that parallel to M. The white 
background corresponds to Apik, E) = 0, while the black regions represent Apik, E) > 
50 a.u.; i.e. the cusps of Apik, E) have been cut for a more resolved representation. The 
left and right parts give the Bloch spectral function decomposed into their minority 
and majority spin part, respectively. 





magnetization M will in general change the symmetry but also the electronic 
structure itself. The corresponding changes in the total energy give rise to the 
magneto-striction [162,163] and the magneto-crystalline anisotropy [164,165]. 
The conventional approach to evaluate magneto-crystalline anisotropy energies 
is to apply the force theorem [166], that allows to approximate the energy dif- 
ference for two orientations of the magnetization as the energy difference of the 
corresponding single particle energies. This means that the small changes found 
for the dispersion relation when the magnetization is rotated [155,158] are the 
microscopic origin of the magneto-crystalline anisotropy. In practice the mentio- 
ned energy differences are calculated by integrating the various energy-weighted 
density of states curves or equivalently the integral DOS curves up to a fixed 
Fermi energy. On the basis of this procedure Ujfalussy et al. performed for multi- 
layer systems a layer-wise decomposition of the magneto-crystalline anisotropy 
energy by using corresponding layer projected DOS functions [167]. Of course, an 
analogous composition can be made for any multi-component system. Using the 
KKR-fornialism an elegant way to perform the above mentioned energy-integral 
for the integrated DOS function is to make use of Lloyd’s formula [168]. This 
has recently been used for a calculation of the magneto-crystalline anisotropy 
energy of disordered fcc-Coa;Pti_a; alloys by Razee et al. [169] who also worked 
out corrections to the expression based on the force theorem. 

Finally, it should be emphasized that all calculations of the magneto-crystalline 
anisotropy energy done so far on the basis of the force theorem or alternative 
schemes [164,165] account only for the spin-orbit coupling as its microscopic 
source. The Breit interaction, that gives rise to contributions in the same order 
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of magnitude, is usually accounted for only in a second step by calculating the 
so-called shape anisotropy energy [165,170,171]. 

Charge and Orbital Current Density Distribution For a paramagnetic 
solid time reversal symmetry implies that spin-orbit coupling viewed as a pertur- 
bation leads for states with quantum numbers (mi, mg) to the same changes as 
for (—mi, —ms). As a consequence the spatial symmetry of the charge distribu- 
tion is not affected and no orbital current is induced. For a spin-polarized solid, 
on the other hand, this docs not hold anymore; i.e. states with quantum numbers 
(mi, ms) and (—mi, —mg) are affected by the inclusion of the spin-orbit coupling 
in a different manner because of the exchange splitting. As a consequence the 
charge distribution will be rearranged according to the lowered symmetry of the 
system compared to its paramagnetic state. For a magnetic solid with a cubic 
lattice and the magnetization along the 2 ;-axis, for example, the effective symme- 
try is only tetragonal. Accordingly, self-consistent full-potential spin polarized 
relativistic calculations lead to non-cubic terms like p 20 and V 201 respectively, 
if for the charge density p and the potential V the conventional expansion into 
spherical harmonics is used [83] (see Eq. (44)). 

A further consequence of the presence of the spin-orbit coupling for a spin- 
polarized solid is that its orbital angular momentum is no more quenched. This 
corresponds to the occurrence of a finite paramagnetic orbital current density 
jp (the adjective paramagnetic can be omitted in the following because external 
magnetic fields are assumed to be absent; i.e. the physical and paramagnetic 
current densities are identical). 

Within the Green’s function formalism used here the current density jp can 
be obtained from the expression 

jp = Trace A f dE — \7]G(r,r' , E)\r=r' ■ (81) 

7T J i 

Corresponding results [83] obtained for the current density jp in bcc-Fe are 
shown in Fig. 3. Here the direction and magnitude of jp is represented by arrows 
for the (OOl)-planc with the z- and magnetization axes pointing upwards. At first 
sight the current density distribution seems to be rotational symmetric. However, 
a closer look reveals that it has in fact a lower symmetry. This is demonstrated 
in the right part of Fig. 3 which gives the radial component of jp within the 
(OOl)-plane. This component is about 2-3 orders of magnitude smaller than jp 
itself and has been scaled by a factor of approximately 350 with respect to the 
left part of Fig. 3. As one notes, there is only a four-fold symmetry axis along 
the z-axis. For the paramagnetic state the x- and y-axes as well as the diagonal 
axes in between would be twofold symmetry axes. Obviously, the corresponding 
symmetry operation C 2 is missing here because of the ferromagnetic state and the 
spin-orbit coupling accounted for. However, one can also clearly see from the right 
part of Fig. 3 that this symmetry operations combined with the time reversal 
operator T result in proper symmetry operations (TC 2 ±) for the ferromagnetic 
state [144]. 
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Fig. 3. Orbital current density jp for bcc-Fe in the (OOl)-plane (left). The right part 
gives the corresponding radial component scaled by a factor of around 350 with respect 
to the left part. For display jp has been weighted with r^. 

3.2 Orbital Magnetic Moments 

With the spiii-orbit-iiiduced orbital current density in magnetic solids there is 
obviously a finite orbital angular momentum density associated. For a rotational 
symmetric current density distribution, for example, one has circular currents 
implying the simple relationship [83] 

iPlz{r)) = , (82) 

where is the (^-component of jp that gives its magnitude along a closed 
circular loop. 

Connected with (/??z) there is of course a corresponding orbital magnetic 
moment /Xorb that can be obtained via Eq. (82) or directly from the conventional 
expression [24,83]: 

Morb = — — Traced J dE j d^r (3l^G{r,r,E) . (83) 

As Fig. 4 shows, the spin-orbit induced /Xorb contributes 5 to 10 % of the total 
magnetic moments of the elemental ferromagnets Fe, Co and Ni. Flowever, one 
also notes from this figure that the results obtained on the basis of plain SDFT 
are much too small compared with experiment in the case of Fe and Co. To 
cure this problem, that also occurs for /-electron systems. Brooks introduced 
the OP-formalism [113], that was originally restricted to fe-space band structure 
methods. Using the real-space formulation given above, one can see that it ef- 
fectively leads to a feed-back of the spin-orbit induced orbital current into the 
potential term of the Dirac equation (see Eq. (19)). Based on the corresponding 
spin- and orbital polarized relativistic (SOPR) KKR-formalism [118], one finds 
a strong enhancement of the orbital magnetic moment for Fe and Co leading to 
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Fig. 4. Orbital magnetic moments for bcc-Fe, fcc-Co and fcc-Ni. The various columns 
represent from left to right the experimental data (E) [172] and the theoretical data 
obtained by the plain SPR-KKR- (K), within CDFT (C) [109] as described in the text 
as well as the SOPR-KKR (O) ]118] including the OP-potential term. The last column 
labeled with L gives results obtained using the LMTO that account for spin-orbit 
coupling and the OP-term in the variational step [51,173]. 



a rather satisfying agreement with experiment (see Fig. 4). The spin magnetic 
moment, on the other hand, is hardly affected by inclusion of the OP-term. Fur- 
thermore, calculations done in the full-potential mode [83] clearly demonstrated 
that the OP-term does not include aspherical potential terms that would be 
accounted twice in a full-potential calculation, as it was sometimes suspected in 
the past. 

Apart from minor numerical differences, the results obtained with the SOPR- 
KKR are completely in line with those obtained before using the LMTO-method 
[51,173,174]. However, the latter approach accounts for spin-orbit coupling and 
the OP-term only in the variational step, while for the SOPR-KKR these are also 
included when calculating the wave functions and the corresponding single-side 
t-matrices. As a consequence the SOPR-KKR can straightforwardly be combined 
with the CPA to deal with disordered alloys. As an example for an application 
of the SOPR-KKR-CPA results for /iorb of bcc-Fea,Coi_a; are shown in Fig. 5 
[118]. 

In contrast to the investigations of Soderlind et al. [175] done using the LMTO 
together with the virtual crystal approximation (VGA) alloy theory [156] the 
SOPR-KKR-CPA supplies component-resolved results. As one can see in Fig. 5 
the enhancement of /Xorb for Fe and Co in bcc-Fea,Coi_a; are very similar to that 
found for the pure metals. Again this enhancement brings the average orbital 
magnetic moment for the alloy in very satisfying agreement with experiment. 

Because the OP-term is very similar in form to the operator representing 
spin-orbit coupling as a correction or perturbation, one may expect that it 
will not only affect the spin-orbit induced orbital magnetic moments but also 
any other quantity caused by spin-orbit coupling. This is in general indeed 
the case as it could be demonstrated by investigations on the spin-orbit indu- 
ced band-splittings [176], the orbital contributions to the hyperfine fields [118], 
the magneto-crystalline anisotropy [177], galvano-magnetic properties [178], the 
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Fig. 5. Orbital magnetic moments in bcc-Fea;Coi_x- The triangles pointing up- and 
downwards represent the theoretical moments of Fe and Co, respectively, while the 
concentration weighted sum is given by circles. Full and open symbols stand for results 
obtained with and without the OP-term included (SOPR- and SPR-KKR-CPA, resp.). 
Experimental data [172] for the average magnetic moment (right part) stemming from 
magnetomechanical and spectroscopic g-factors are given by full squares and diamonds. 



magneto-optical Kerr effect [179] and the magnetic dichroism in X-ray absorp- 
tion [180]. 

Using the OP-formalism, one obviously leaves the framework of density fun- 
ctional theory and arrives at a heuristic hybrid scheme. From a formal point of 
view CDFT therefore supplies a much more satisfying basis to deal with orbital 
magnetism. Results for /Torb of Fe, Co and Ni, that have been obtained using the 
relativistic version of Vignale and Rasolt’s CDFT-formalism, are given in Fig. 4 
[109]. Obviously using CDFT instead of plain SDFT leads indeed to an enhan- 
cement of /iorb for Fe and Co. Although this effect is found to be too small, one 
may expect that the remaining deviation from experiment will be reduced with 
improved parameterizations for the exchange-correlation potentials available. 

The basic CDFT-FIamiltonian in Eq. (14) does not rule out the existence 
of a finite orbital magnetic moment in the non-relativistic limit. With the help 
of model calculations, it could be demonstrated that this is not the case [109]. 
Starting a SCF-calculation with a finite spin-orbit induced orbital current den- 
sity and switching off the spin-orbit coupling during the SCF-cycle the orbital 
magnetic moment vanished. 

Using Vignale and Rasolt’s formulation of CDFT [104,105] , one has to use the 
CTirrent density in a spin-projected way. This leads to quite large contributions in 
the nucleus near region stemming from the core states, that essentially cancel if 
the spin contributions are summed up (see for example Fig. 3). However, for tran- 
sition metals the corresponding core contributions to the exchange-correlation 
potential lias not much overlap with the current density of the valence 

d-electrons. Because ^Uorb stems primarily from these, the core contribution to 
Axc,cr(j') can be neglected. For bcc-Fe the corresponding valence band part of 
the polar component of the spin-dependent exchange-correlation vector potential 
Axe, a {'I') is given in Fig. 6 (left part). Because the OP-term can be manipulated 
to represent a coupling to the electronic orbital degree of freedom or current. 
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Fig. 6. Left: Valence band part of the polar component of the spin-dependent 
exchange-correlation vector potential A^c,a{r,9) for bcc-Fe (in atomic units). The full 
and dashed lines give the potential for minority and majority character, respectively, 
for 9 = 7t/2. Right: The OP potential term for bcc-Fe as calculated by the self- 
consistent FP-OP-SPR-KKR. 

respectively, a corresponding vector potential function can be constructed 
within the OP-formalism [83]. However, one has to keep in mind that the phy- 
sical picture behind the OP-formalism is quite different from the CDFT as used 
here. While for the former case one tries to account in an approximate way for 
intra-atomic correlations, the vector potential occurring within CDFT is due to 
diamagnetic contributions to the exchange-correlation energy of the electron gas. 
Accordingly, it is not surprising that the resulting vector potential function (see 
Fig. 6) for the OP-formalism is quite different from that obtained within CDFT. 
In spite of this fundamental difference one finds the current density calcu- 
lated within the extended OP-formalism to be very similar to that calculated 
within the framework of CDFT [83], i.e. both differ only with respect to their 
absolute magnitude but not concerning their radial variation. For this reason, 
the OP-formalism, that is extremely simple to be implemented, may be used 
to study the influence of corrections to the exchange-correlation energy due to 
finite orbital currents as long as no better parameterizations for this have been 
derived within CDFT. 

3.3 Hyperfine Interaction 

Quadrupolar Hyperfine Interaction The nuclear quadrupolar hyperfine in- 
teraction, that can be investigated experimentally, for example, by means of 
Mofibauer spectroscopy or NMR, denotes the coTipling of the nuclear quadrupole 
moment Q with the electric held gradient (EFG) stemming from the surroun- 
ding electronic charge distribution. Because the nuclear quadrupolar hyperfine 
interaction reflects the local site-dependent symmetry of the charge and poten- 
tial distribution in a rather direct way, it provides a unique tool to investigate 
the consequence of the spin-orbit coupling for the symmetry of a spin-polarized 
cubic solid. 

The electric field gradient, that is only non-zero for a site-symmetry lower 
than cubic, can be calculated straightforwardly from the Coulomb part of the 
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electronic potential V. Expanding V into spherical harmonics (sec Eq. (44)) one 
has, for example, for the zz-component of the electric field gradient tensor [181]: 



^zz = eg = 2 liin V 2 o{r) . 

r ->0 



(84) 



As an alternative, the EFG can also be calculated from the corresponding non- 
spherical charge distribution term p 2 o{i^) within the central Wigner-Seitz cell 
and a Madelung contribution stemming from the multipolcs on the neighboring 
sites [130,146,181], 

For the case that the electric field gradient is caused by the low symmetry 
of the underlying lattice it was found in the past that it is often sufficient to 
perform self-consistent calculations for the charge density only in the spherical 
approximation and to determine the non-spherical charge density pimi (r) only in 
the final iteration (spherical approximation). In addition it was found from FP- 
SPR-KKR calculations on Fe that the spin-orbit induced electric field gradient 
stems nearly exclusively from the non-spherical charge density within the central 
cell containing the nucleus while the contribution of the surrounding can be 
ignored [146], Using these simplihcations together with the AS A- version of the 
SPR-KKR formalism the electric field gradient tensor component <p 2 z can be 
written as [139]: 
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with the angular matrix elements Baa' given by 



Baa' = 00) (87) 

47T 

^ C^‘C^PC{11'2; [p - ms)i-p + ms)) 



and the Clebsch Gordon coefficients {p — ms)ms) represented by the short 

hand notation C™” . 

This approach has been used to study the properties of 5d-transition metal 
impurities dissolved substitutionally in bcc-Fe [182], These impurity type calcu- 
lations have been done in the single-site approximation ignoring the distortion 
of Fe-atoms in the vicinity of the impurity. The resulting electric field gradient 
parameter q = ^zz/e is shown in Fig. 7 for the whole 5d-serics. First of all one 
notes that q is only around one order of magnitude smaller than that usually 
found for a lattice-induced EFG [187]. Taking into account that the existing ex- 
perimental data are scattering quite strongly and that measurements on single 
crystals with a definite relative orientation of the magnetization and the crystal 
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Fig. 7. EFG parameter q of 5d-transition metal- impurities in Fe. Theoretical results 
for the magnetization direction along the z-axis together with their decomposition into 
p- and d-electron contributions are given by full, dotted and dashed lines, respectively. 
Experimental data stem from powder as well as from single crystal measurements [183- 
186]. 



axis were done only in some few cases, agreement of the theoretical results with 
experiment is quite satisfying. In particular the pronounced dip of the EFG as a 
function of the ordering number Z for Ir seems to be confirmed by experiment. 
The variation found for g as a function of Z strongly resembles that obtained 
earlier for the spin-orbit induced contributions to the magnetic hyperfine field 
[188], which are predominantly of orbital origin. Earlier, more qualitative in- 
vestigations [189,190] assumed that this contribution is exclusively due to the 
d-electrons. However, it turned out that the p-clcctrons contribute to a simi- 
lar extent to the magnetic hyperfine field showing only a weak variation with 
Z [188]. A similar situation is encountered here for the EFG parameter q. As 
can be seen in Fig. 7 the p-electrons contribute in particular for the early 5d- 
transition metals, where they exceed the d-electron contribution. Nevertheless, 
one also notes that the variation of q with Z is primarily due to its d-electron 
part. 

Based on perturbation theory it was expected that the spin-orbit induced 
EFG should depend quadratically on the spin-orbit coupling strength [184]. 
Using the manipulation scheme described in section 2.2 this could be verified 
for the d-electron contribution, while for the p-electroiis a pronounced deviation 
from the quadratic dependency has been found. So far it has been assumed that 
the spin-orbit induced EFG stems predominantly from the spin-diagonal part 

of the spin-orbit coupling (see Eq. (66)). However, performing corresponding 
model calculations it was found that the spin-mixing part ^xy contributes to the 
same order of magnitude but with opposite sign. 

Because the EFG reflects the local symmetry one expects it to change with 
the orientation of the magnetization; i.e. it should monitor the fact that the 
spin-orbit induced anisotropy is anisotropic. This could indeed be demonstrated 
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recently by Secwald ct al. [46], who determined the EFG of Ir in bcc-Fe for 
the magnetization pointing along the [001]-, [111]- and [110]-directions. It is 
interesting to note that this type of anisotropy was too weak to be detected 
in the case of magneto-optical Kerr effect (MOKE) investigations on fcc-Co 
[191,192], 



Magnetic Hyperfine Interaction The magnetic hyperhne interaction repre- 
sents the interaction of the nuclear magnetic moment with the current density 
j of the surrounding electronic system. In its relativistic form the corresponding 
operator 71 hf is given by [7,193]: 

Tlhf = eca ■ pn X r/r^ . (88) 

Here only the static part of the hyperfine interaction is considered. Assuming 
the magnetization and quantization axis to point along the z-axis, only the part 
Tlhf.z oc az{fJ-n X J')z has to be accounted for. 

Within a non-relativistic theoretical description of the hyperfine interaction it 
is conventional to split the total hyperfine interaction operator into three distinct 
contributions: the Fermi-contact, the spin-dipolar and the orbital terms. While 
the first is relevant only for s-electrons, the other two are connected exclusively 
to non-s-electrons. Starting from a Gordon-decomposition of the electronic cur- 
rent, a corresponding decomposition of the hyperfine interaction operator Tlhf 
in Eq. (88) can also be made within relativistic theory [139,182,194]. For the 
orbital part one gets for example the expression: 

71hf.orb = 2MB/3Mnr|^_3|”;; , (89) 

where r„ is the nuclear radius. This expression already indicates that for the 
decomposition of the relativistic hyperfine interaction operator a nucleus of finite 
size has to be considered [182,194]. Furthermore one has to note that the various 
parts of Hhf are no more exclusively due to s- or non-s-electrons, respectively. 

For spontaneously magnetized solids the central hyperfine interaction para- 
meter is the hyperfine field Hhf. This quantity is determined by the expectation 
value of the static part of the hyperfine interaction operator: 

Bhf = (Tlhf,z)/^7n , (90) 

with 7 „ the nuclear gyromagnetic ratio. Representing the underlying electronic 
structure by means of the Green’s function formalism {'Hhf z) in turn is given by 
[24]: 



(Tlhf.z) 



— Traces J dE J d^r Hhf, zif) G{r,r, E) . 



(91) 



Dealing with this expression on a non-relativistic level one gets contributions 
to the hyperfine field Bhf only from the Fermi-contact and spin-dipolar terms 
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because the orbital magnetization density is quenched in the solid [109] (see 
above) . Contributions due to the spin-dipolar term are in general ignored because 
they arise only from a non-cubic electronic spin density distribution. For these 
reasons the standard approach to calculate hyperhne fields is to determine just 
its Fermi-contact contribution stemming from s-electrons. In contrast to this 
simple but conventional approach, the fully relativistic scheme given above leads 
to contributions to the hyperhne held from non-s-electrons as well. These are 
caused by the spin-orbit coupling and are non-negligible even for cubic systems 
[25]. 

The left part of Fig. 8 shows the total hyperhne helds of Fe in fcc-Fea;Pdi_a; 
together with a decomposition into contributions stemming from the core, va- 
lence and non-s-electrons. The experimental data available for Fe indicate that 





Fig. 8. Left: Hyperfine fields of Fe in fcc-Fea,Pdi_a;. In addition to the total field i?hf 
(total) the contributions of the core, valance and non-s-electrons are given separately. 
Available experimental data have been added. Right: Contributions Bvai of the d-like 
valence electrons to the hyperfine fields of Fe in fcc-Fea,Pdi_a,. The fields Sorb represent 
the corresponding orbital part. This has also been determined by the approximation 
due to Abragam and Pryce (AP, see Eq. (3.3)) [195]. 



the theoretical helds are too small in magnitude. Discrepancies similar to these 
have been found before for many other systems and have been ascribed to pro- 
blems in dealing with the core polarization contribution when the spin density 
functional theory is used on a local-density approximation (LDA) level [20,24]. 
However, improvements to the LDA, like the generalized gradient approximation 
(GGA) [196] or the self-interaction correction (SIG) [196,125] did not give much 
better results. Using the optimized effective potential (OEP), on the other hand, 
very satisfying results for the hyperhne helds of Fe, Co and Ni could be achieved 
recently [197]. 

Within a non-relativistic calculation of the hyperhne helds in Fea,Pdi_a; one 
would get only contributions due to the s-electrons via the Fermi-contact inter- 
action. Within a fully relativistic investigation this part is enhanced by about 
10% for Fe [25]. In addition one hnds quite appreciable contributions from non- 
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s-clcctroiis. These are iiidueed by the spin-orbit coupling and in general opposite 
to the normally dominating negative core polarization fields. 

The dominating part of the non-s-fields stemming from the valence band 
electrons is given once more in Fig. 8 (right part). For Fe this stems nearly 
exclusively from the d-electrons. With the proper relativistic decomposition of 
the hyperfine interaction in Eq. (88), the origin of these fields can be investigated 
in a detailed way. The corresponding fields i?orb,d obtained using the relativistic 
orbital hyperfine interaction operator (see Eq. (89)) have been added to Fig. 8. 
As one can see these fields differ only slightly from Bvai.d implying that the helds 
coming from d-electrons via the Fermi-contact and spin dipolar interaction arc in 
general negligible. This is also confirmed by an additional and direct calculation 
of these fields. For this reason it is quite well justified to call the spin-orbit 
induced hyperfine fields coming from non-s-electrons in a somewhat loose way 
orbital [25]. 

One of the most important consequences of the spin-orbit coupling for ma- 
gnetic solids is the presence of a spin-orbit induced orbital electronic current 
density that gives rise - according to Eq. (89) - to the orbital hyperfine fields 
but that causes also a corresponding orbital contribution /^orb to the total ma- 
gnetic moment. Because of their common physical origin one can expect the 
fields Borb and the moment /Uorb to be related via [195]: 

Borb,/ — 2/Xb {r ) t^orb,/ {1 ~ Pi ^) ■ 

As it can be seen in Fig. 8, this simple approximation works quite well justifying 
once more the designation orbital used above. 

3.4 Linear Response 

Static Magnetic Susceptibility and Knight Shift Using the Green’s fun- 
ction formalism for a description of the underlying electronic structure gives 
several important advantages when dealing with response quantities. In the case 
of the static magnetic susceptibility, for example, it is straightforward that way 
to deal with inhomogeneous systems. This has been demonstrated among others 
by Terakura et al. [198], who calculated the non-local site-dependent suscepti- 
bility of several paramagnetic transition metals. A corresponding relativistic 
approach has been worked out by Staunton [199] that has been applied to pure 
transition metals [200,201] with fee and hep structure, respectively. The first step 
of this approach is to use the first order-approximation to the Dyson equation 
to represent the Green’s function of the investigated system in the presence 
of an external magnetic field Bext in terms of the Green’s function G of the 
unperturbed system: 

G^{r,r',E) = G{r,r',E) + 

I d^-"G{r,r",E)nlt\{r")G{r",r',E) . 



(92) 
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Within the framework of SDFT the perturbing Hamiltonian is given 

by [199-201]: 

nlt\ = t^zpMBSext - (l4 - Ki)] . (93) 

Here the second term represents a feedback of the induced spin-magnetization 
via the modified exchange and correlation potentials that gives rise to the 

Stoner enhancement mechanism [202]. The spin susceptibility Xspin is obtained 
from Eqs. (92) and (93) by calculating the expectation value of the operator 
and eliminating the external field i?ext [199]: 

Xspin = Trace 9 j dE j d\ Pa,G^ {r,r, E) . (94) 

The procedure sketched here is not restricted to calculations of the spin 
susceptibility but has a much broader range of application. First of all one has 
to note that the perturbing Hamiltonian in Eq. (93) represents just the coupling 
of the external magnetic field to the spin of the electrons. In addition, there is 
a coupling to the orbital degree of freedom. Within a non-relativistic treatment 
this gives rise to the diamagnetic Langevin and Landau susceptibilities and the 
paramagnetic VanVleck susceptibility [203,204]. A straightforward way to obtain 
the later contribution in a fully relativistic way is to replace the operator Xpe™ 
in Eq. (93) by: 

= 2pB4Sext (95) 

and replacing by 4 in Eq. (94) giving the VanVleck susceptibility xvv instead 
of Xspin- Here it has to be noted that the standard non-relativistic treatment of 
the spin- and orbital magnetic susceptibility do not lead to any spin-orbital cross 
terms. Within a fully relativistic treatment, on the other hand, the perturbing 
Hamiltonian Hpe™ in Eq. (93) will lead to a non-vanishing orbital susceptibi- 
lity due to the spin-orbit coupling; i.e. replacing by in Eq. (94) for Xspin 
will give an orbital contribution in addition to the pure spin susceptibility Xspin- 
Analogously, calculating the expectation value of with the orbital pertur- 
bation Hamiltonian in Eq. (94) will give rise to a spin contribution in 

addition to the pure orbital VanVleck susceptibility. The remaining contribu- 
tion to the orbital susceptibility - the Langevin and Landau susceptibilities - 
can also be treated in a fully relativistic way by a corresponding extension of 
the non-relativistic theory [203]. 

A quantity that is closely connected to the susceptibility is the Knight shift 
that gives the ratio of the induced hyperfine field Hhf seen by a nucleus and the 
external inducing magnetic field. Using the linear response formalism sketched 
above one has [25,205]: 

K = --—4— Trim [ dE [ Hhf.z(r) G(r, r, E) (96) 

7T ft7„i?ext J J 

where T^hf.z is the z-part of the relativistic hyperfine interaction Hamiltonian (see 
Eq. (88)). Making use of the Gordon decomposition of the electronic current 
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density, K can be decomposed into the conventional Fermi-contact, the spin 
dipolar and the orbital part. 

So far, there are only very few theoretical investigations on the Knight shift 
in transition metal systems that can be found in the literature. Very similar to 
the situation for the hyperfine field of spontaneously magnetized solids nearly 
all of these considered only the Fermi contact interaction due to the s-electrons. 
Using a non-relativistic version of the linear response formalism presented above, 
the first calculation of all contributions to the Knight shift has been done for 
the transition metals V, Cr, Xb and Mo [206]. For these metals the VanVleck 
contribution to the magnetic susceptibility and to the Knight shift was found to 
be of the same order of magnitude as the various spin contributions and to stem 
nearly exclusively from the d-electrons. Concerning the magnetic susceptibility 
similar results have been obtained by Yasui and Shimizu using a non-relativistic 
[207] as well as fully relativistic [208] approach. 

Results obtained using the above relativistic linear response formalism are 
given in Fig. 9. The left panel of this figure shows the VanVleck susceptibility 
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Fig. 9. VanVleck susceptibility yvv (left), expectation value of r ® (left) and Van- 
Vleck contribution to the Knight shift Kvv (right) for the d-valence band electrons of 
the pure 4d-transition metal elements. The estimation for idw based on the approxi- 
mation proposed by Abragam and Pryce (AP) as well as the total valence contribution 
Aval due to the orbital perturbation term Apert in Eq. (95) has been added (right). 



Xvv of the pure 4d-transition metals. As found within earlier studies a maximum 
is present for xvv roughly in the middle of the row. This can be explained by 
using a simplified expression for xvv [209] and the fact that here the product 
rio ■ Up of the number of occupied (uq) and unoccupied (up) d-states is at its 
maximum. From the relationship of the orbital parts of the magnetic moment 
and the hyperfine field (see Eq. (3.3)) one can expect an analogous relationship 
for the VanVleck contributions to the susceptibility and Knight shift: 

Kvv,i = 2 /xb (r“^) Xvv.i {l=P,d). 



(97) 
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As one can see in Fig. 9 (left) the expectation value (r“^)d for the 4d-clcmcnts 
increases rapidly along the 4d row. The reason for this is that the correspon- 
ding d-like wave function gets more and more localized with increasing atomic 
number. Combining the results for xvv and {r^^)d using Eq. (97) leads to an 
estimate for the VanVleck Knight shift A'vv that is shown in Fig. 9 (right). Due 
to the variation of {r~^)d the maximum in Kyy is obviously shifted to the right 
compared with the xvv curve. Fig. 9 shows in addition the shift Kyy that has 
been obtained by the full formalism; i.e. Eqs. (89), (92), (95), and (96). As one 
can see the estimation of Kyy using Xvv and (r^^)d reproduces the variation 
with atomic number quite well. However, in line with previous non- relativistic 
results [206], the absolute values differ by up to about 20%. This means that 
estimations based on Eq. (97) are in general less reliable than spin-orbit induced 
orbital hyperfine helds estimated using Eq. (3.3). 

Finally, it should be emphasized that the VanVleck Knight shift Kyy given 
in Fig. 9 stems from the coupling of the external magnetic field to the orbital 
degree of freedom (see Eq. (95)). Because of the use of the relativistic orbital 
hyperfine interaction operator (see Eq. (89)) it is by definition of pure orbital 
nature. If the full hyperfine interaction operator (Eq. (88)) is used instead, the 
Knight shift denoted ATvai in Fig. 9 is obtained. The small difference between 
Kyy and K^ai is of pure spin nature. Within a non-relativistic formalism this 
cross contribution cannot be accounted for, because it is a consequence of spin- 
orbit coupling. 

This section is somewhat outside the main issue of this review, because it 
presents results for paramagnetic solids. However, one should notice that the 
formalism presented above can be applied without modifications to deal with the 
high-held susceptibility of spontaneously magnetically ordered solids, as well as 
the Knight shift in such systems. Furthermore, one can apply the linear response 
formalism also to deal with magneto-crystalline anisotropy or the various kinds 
of spin-spin coupling constants [168,210]. 



Transport properties A further interesting application of the linear response 
formalism is the treatment of galvano-magnetic properties of disordered alloys. 
Here the Knbo-Greenwood formalism allows one to express the elements of the 
electrical conductivity tensor cr in terms of a current-current correlation function 
[ 211 , 212 ]: 



(98) 

conf . 

where is the p-th spatial component of the electronic current density operator 
j = ecoL. In the following it is assumed that a hnite conductivity or resistivity, 
respectively, of the investigated system stems exclusively from chemical disorder, 
i.e. contributions caused by lattice imperfections, grain boundaries, phonons, 
magnons, and so on are ignored. This implies in particular that one is dealing 
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with the residual resistivity and that (■■•)(,onf denotes the atomic 

configuration average for a disordered alloy. 

For a paramagnetic cubic solid the conductivity tensor that results from Eq. 
(98) is diagonal with all elements identical, i.e. the conductivity is isotropic. 
For a ferromagnetic cubic solid, however, this is not the case and the form of 
the conductivity tensor depends on the direction of the magnetization reflecting 
the lowered symmetry of the system. A very general procedure to work out the 
corresponding symmetry properties of response functions has been developed by 
Kleiner [213] (for a somewhat alternative approach, see for example Ref. [214]). 
For example, for a cubic solid with the magnetization along the z-axis one finds 
that way the form [215]: 

/ Pi -pH 0 \ 
p = = PH p_L 0 

Vo 0 Pii/ 

Here p is the resistivity tensor with pi and p|| the transverse and longitudinal 
resistivities, respectively, and pn the spontaneous or anomalous Hall resistivity. 
In addition one dehnes the spontaneous magnetoresistance anisotropy (SMA) or 
anomalous magnctorcsistancc (AMR) ratio by [215]: 

P||(.^ext) P-L{^ ext) 

P p(-^ext) 

where p = |(2pi + p||) is the isotropic resistivity. Here the notation empha- 
sizes that experimentally the SMA is determined by measuring p||(ilext) and 
px(i?ext) as a function of an applied external magnetic field Sext and extrapola- 
ting to Hext = 0. The reason for this is that in contrast to investigations on the 
conventional magnetoresistance zip/p = (p(Hext) — p(0))/p(0) of paramagnetic 
solids, the external magnetic field is used here only to align the magnetization 
of a sample along a certain direction. 

The CPA formalism directly gives access to the configurationally averaged 
Green’s function of disordered alloys. However, when calculating response func- 
tions for disordered alloys - as for example the conduetivity given in in Eq. (98) 
- one has to deal with the configurational average of the product of two Green’s 
functions. Within the framework of the KKR-formalism, this problem has first 
been investigated by Staunton [199] based on the work of Durham et al. [216]. 
With respect to the conductivity tensor a corresponding expression that is con- 
sistent with the single-site CPA-formalism has been worked out in great detail 
by Butler [212]. This approach, derived originally for the non-relativistic case, 
can straightforwardly be applied also for the spin polarized relativistic case [44] 
leading - in contrast to a non-relativistic scheme - in particular to an anisotro- 
pic conductivity tensor (sec for example Eq. (99)). This makes clear that the 
galvano-magnetic effects AHR and SMA are all spin-orbit induced phenomena. 

As an example of an application of the formalism sketched here the calculated 
isotropic resistivities p for the alloy systems Coa,Pdi-a; and CoajPti-a, are shown 
in the left part of Fig. 10 [160] together with corresponding experimental data 
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measured at low temperature. As one can sec, the agreement between calculated 
and measured resistivities is very good for Coa,Pdi_a;. The maximum value of 
the resistivity in this system (16/rl7-cm) as well as the composition for which the 
maximum occurs (about 20% Co) are well reproduced by the calculations. 





Fig. 10. Left: residual isotropic resistivity p of disordered CoajPdi-a, (•) and CoaiPti-a, 
(o) alloys. Full lines: calculated including vertex corrections, broken lines: calcula- 
ted omitting vertex corrections. Right: calculated spontaneous magnetoresistance ani- 
sotropy (SMA) ratio Ap/p of Coi,Pdi_a; (•) and CoajPti-a, (o) alloys. Experimental 
data presented by open squares, diamonds, triangles and crosses stem from various 
sources (see Ref. [160]) 



Using Butler’s approach in dealing with Eq. (98) one accounts for the so- 
called vertex corrections within the framework of the CPA. For Coa,Pdi_a; it 
was found that their contribution increases from about 2% for 5 at.% Co to 
about 25% for 80 at.% Co. 

For the system Coa,Pti_a; the calculated resistivities are much higher than for 
Coa;Pdi_a;, reaching almost 40 p,l?-cm for 30 at% Co. This agrees in a satisfactory 
way with the experimental maximum of about 35 /il7-cm at that composition. 
In contrast to Coa,Pdi_a; the vertex corrections are quite small for Coa;Pti_a;, 
contributing less than 3% to the total conductivity over the entire composition 
range. Previous investigations on paramagnetic alloy systems [217], lead to the 
conclusion that the vertex corrections are the more important the lower the d- 
like DOS at the Fermi level is. For Cua;Pti_a; [217], for example, this applies to 
the noble metal rich side of this system. For ferromagnetic systems, on the other 
hand, the vertex corrections seem to be more important, if the d-like DOS at 
the Fermi level is low at least for one spin subsystem. For this reason, they are 
more pronounced for Coa,Pdi_a; compared to Coa;Pti_a; and more important on 
the Co-rich side of both systems. 

The anisotropy ratios (SMA) calculated from the transverse and longitudinal 
resistivities are shown in the right part of Fig. 10 for the two alloy systems 
Coa;Pdi_a; and Coa,Pti_a;. Experimental values for both systems are included 
for comparison. Coa,Pdi_a; shows remarkably high SMA values of more than 6% 
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for concentrations higher than 20 at.% Co [218,219], The calculations reproduce 
the increase of the experimental data at low Co concentrations very well. For 
higher Co concentrations the calculated values are slightly too low. Note that 
the SMA in Coa,Pdi_a; is still as large as 1.5% even for very low Co contents 
[219,220] which was attributed to local orbital moments on the magnetic sites 
in Ref. [219]. In contrast to CoajPdi^a, the SMA for CoajPti^a, was found to be 
below 1% throughout the whole concentration range [221,222]. These findings 
are perfectly reproduced by the relativistic calculation which reflects the slowly 
varying SMA in Coa,Pti_a;. 

It was realized already years ago that the SMA and also the AHR are cau- 
sed by the spin-orbit coupling. Nevertheless, for the discussion of experimental 
data corresponding phenomenological descriptions had to be used in the past. 
These approaches were based on Mott’s two-current model that ascribe to each 
spin subsystem an independent current contribution and introduced a number of 
model parameters. The SPR-KKR-CPA formalism, on the other hand, does not 
rely on Mott’s two-current model and allows for a parameter-free and quantita- 
tive investigation of galvano-magnetic properties. By manipulating the strength 
of the spin-orbit coupling it was possible in particular to demonstrate explicitly 
the dependency of the SMA and the AHR on the spin-orbit coupling [223]. In 
addition it could be shown that even the isotropic resistivity p can be strongly 
influenced by the spin-orbit coupling, as it has been expected before [159]. 

Further insight into mechanisms giving rise to galvano-magnetic effects can 
be obtained by decomposition of the spin-orbit coupling. To demonstrate this, 
corresponding results for p and SMA ratio Apjp are given in Fig. 11. The left 
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Fig. 11. Isotropic residual resistivity p (left), and spontaneous magnetoresistance 
anisotropy ratio Apjp (right) of disordered bcc-Fea,Coi_x alloys calculated in four 
different ways. The results obtained using the full spin-orbit coupling - indicated by ^ 
- are represented by full circles. Open triangles give the results obtained keeping the 
xy-part ^xy and zz-part respectively, of the spin-orbit coupling. Full squares give 
the result with the spin-orbit coupling completely suppressed (4 = 0). 



part of this figure shows the isotropic residual resistivity p of bcc-Fea;Coi_a; ob- 
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tained from calculations using the full spin-orbit coupling (^). As one notes, the 
variation of p with composition is strongly asymmetric. This corresponds to the 
experimental findings [224,225]. The deviation from a parabolic shape can be 
qualitatively explained by the change of the DOS at the Fermi energy n(i?F) 
that decreases monotonously with increasing Fe content. Keeping only the spin 
mixing part ^xy of the spin-orbit coupling (see Eq. (66)), one finds that p hardly 
changes. This already indicates that the spin mixing is the primary source for the 
relativistic enhancement of p. This spin mixing or hybridization has already be 
demonstrated by means of the spin-projected Bloch spectral function E) 

in Fig. 2 showing for fcc-Fco. 2 Nio .8 that there is an appreciable minority spin 
character admixed to the majority spin states which form a T-centered sheet 
of the Fermi surface and which primarily carry the electronic current [159,157]. 
Admixture of minority spin character clearly opens for these states a new scatte- 
ring channel that is very effective because of the high DOS (Ep) at the Fermi 
energy with minority spin character. As a consequence, the total resistivity has 
to go up remarkably compared to a calculation based on the two-current model 
[161]. This interpretation is confirmed by the results obtained by keeping just 
^zz] i-e. suppressing the spin mixing effect of the spin-orbit coupling. In Fig. 11 
one can sec that this manipulation leads to a strong reduction of the total re- 
sistivity throughout the whole range of concentration. To demonstrate that the 
remaining part ^zz of the spin-orbit coupling has practically no influence on p, an 
additional calculation has been carried out with the spin-orbit coupling comple- 
tely suppressed (^ = 0). The corresponding results nearly completely coincide 
with the ^zz-data confirming this expectation. Here one should note that the 
latter calculational mode (i^ = 0) ^ although technically somewhat different - 
corresponds essentially to a calculation on the basis of the two-current model, 
where the electronic structure is calculated in a scalar relativistic way; i.e. with 
the relativistic corrections Darwin- and mass-velocity-terms taken into account 
[223]. 

For the spin-orbit induced SMA ratio the results obtained by the various 
calculations are given in the right part of Fig. 11. Here one finds that keeping 
only (^xy slightly reduces Ap/p . This means that in contrast to p, ^zz has some 
small effect on this quantity. Nevertheless, one finds that keeping ,^zz alone brings 
Ap! p essentially to zero. From this result it can be concluded that the part ^zz 
of the spin-orbit coupling can in general be neglected as a source for the SMA 
compared to Finally, setting ^ = 0 of course reduces Ap/p exactly to zero 
[223]. 

The model calculations performed for the residual resistivity tensor elements 
of Fea;Coi_a; allow to check the above mentioned phenomenological models for 
the galvano-magnetic effects. For example, Smit ascribed the occurrence of the 
SMA to the spin hybridization caused by the spin-orbit coupling [226]. From 
an analysis of experimental data, on the basis of corresponding expressions for 
Ap/p, .Jaoul et al. concluded that there should be an additional contribution due 
to the spin-diagonal part of the spin-orbit coupling [227]. The results presented 
in Fig. 11 clearly demonstrate that the mechanism discussed by Jaoul et al. can 
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be neglected for the isotropic resistivity p and has only minor contribution to 
the SMA in the case of the alloy system bcc-Fea;Coi_a;. 

Finally, it should mentioned that the expression for the conductivity tensor 
cr given in Eq. (98) has been generalized recently by Butler et al. [228] to deal 
with the giant magnetoresistance (GMR) of multilayer systems. A corresponding 
spin polarized relativistic formulation has been given by Weinberger et al. [229] 
that includes in particular the influence of the spin-orbit coupling. An extension 
of Eq. (98) to finite frequencies uj is also straightforward leading to absorptive 
part of the optical conductivity tensor cr(w). This has been done recently by 
Banhart for the visible regime of the light and for paramagnetic alloy systems 
[230]. For ferromagnetic systems this extension gives directly access to the spin- 
orbit induced magneto-optical Kerr effect (MOKE) that has been investigated so 
far exclusively by means of conventional fc-space band structure methods as for 
example the SPR-LMTO [42]. The only exception to this is the work Huhne et 
al. [231], who developed a very general expression for a (w) within the framework 
of SPR-KKR. This approach should in particular supply a sound basis for an 
investigation of the oscillations of the Kerr-rotation observed for layered surface 
systems like Au/Fe/Au [232-234]. 

3.5 Spectroscopy 

Magnetic Circular Dichroism in X-ray Absorption Magneto-optical ef- 
fects in the visible regime of light arc known now for more than 100 years [41,235] 
and it was realized more than 60 years ago that spin-orbit coupling plays a cen- 
tral role for these [236]. Guided by their experience with the magneto-optical 
Kerr effect (MOKE) Erskine and Stern [237] suggested that there should be a 
corresponding magnetic dichroism in X-ray absorption when circularly polari- 
zed radiation is used. This magnetic circular X-ray dichroism (MCXD) could 
be demonstrated for the first time for transition metals by Schiitz et al. [43] 
by measurements at the K-edge of Fe in bcc-Fe in the XANES-region. Later on 
these authors could also observe the magnetic dichroism in the EXAFS region by 
investigation on the L 2 , 3 -edgc spectra of Gd in hep-Gd [238]. Motivated by the 
MCXD-measurements on bcc-Fc a corresponding fully relativistic description 
has been developed that is based on the SPR-KKR-formalism [84,85] and that 
has been applied since then to a great variety of different systems [42] . Recently, 
this approach was extended to deal with magnetic EXAFS (MEXAFS) by ma- 
king use of the cluster approximation for the multiple scattering representation 
of the final states. 

Using the SPR-KKR-formalism the X-ray absorption coefficient is 

given by [42]: 

oc 9 ^ ^ + hio)T27, (Ei + kuj)Mf^{Ei + hw) 

iocc \_AA' 

A 
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Here the sum i runs over all involved core states with energy Ei and wave 
function The electron-photon interaction operator Xg\, occurring in the 
matrix elements carries in particular information on the wave vector q 

of the radiation and on its polarization A. The last term in Eq. (101) is 
an atomic-like matrix element [42] and is connected to the term in the Green’s 
function involving the irregular solution to the Dirac equation (see Eq. (71)). 
Accordingly, it contributes only when working with complex energies. 

Although there are also various forms of linear magnetic dichroism, most ex- 
perimental investigations on the magnetic dichroism in X-ray absorption spec- 
troscopy use circularly polarized radiation because the circular dichroism is most 
pronounced. To allow for a sound interpretation of the corresponding dichroic 
signal All = , given by the difference in absorption of left and right 

circularly polarized radiation, a set of so-called sum rules have been derived by 
several authors [239-242]. The main virtue of these rules is that they should 
allow one to obtain a reasonable estimate for expectation values (az) and (Iz) 
of an absorber atom from its energy integrated dichroic signals J AiJ,[E)dE. Of 
course, this is a very appealing property because these quantities are directly 
proportional to the spin and orbital magnetic moments, yUspin and yUorb, respec- 
tively. However, in applying the sum rules one of the main problems is to fix the 
upper energy integration limit. For that reason it has been suggested to apply 
the sum rules in their differential form and to discuss the dichroic spectra Ajj,{E) 
directly. For the L 2 , 3 -edges these differential sum rules are given by [77]: 

3 [Zi^La - (102) 

2 [4i/iL3 + = Cd — (lz)d ■ (103) 

Here Cd is a normalization constant and Tz is the magnetic dipole operator, that 
often can be ignored. Thus, the basic information to be deduced from the dichroic 
signal are the spin- and orbital polarization, ;jg(CTz)d and -^{lz)d, respectively, 
of final states with d-character. 

The magnetic dichroism of the L 2 , 3 -cdgc spectra of Pt in disordered Fea;Pti_a; 
alloys has been studied experimentally as well as theoretically in great detail in 
the past [243-247]. Typically for Pt L 2 , 3 -spectra it was found that the white 
lines at the L 2 - and L 3 -edges are quite different because of the influence of the 
spin-orbit coupling acting on the final states. This finding makes clear that a 
fully relativistic approach is indispensable to achieve a quantitative description 
of the L 2 , 3 -absorption spectra of Pt. This applies in particular if one is dealing 
with magnetic EXAFS (MEXAFS). To deal with the MCXD in the X-ray ab- 
sorption of Pt in ordered FesPt the scattering path operator entering the 
expression for the absorption coefficient was calculated using the matrix 
inversion technique for a cluster of 135 atoms in the XANES and 55 atoms in 
the EXAFS region, respectively, including the central absorber site. The effects 
of self-energy corrections [141,248] have been accounted for after calculating the 
spectra. 
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The top panel of Fig. 12 shows the results of calculations for the L 2 -cdge 
EXAFS-spectra of Pt in ordered FeaPt. Corresponding experimental data, ob- 
tained by Ahlers and coworkers [249] for an ordered but slightly off-stoichiometric 
sample, are added. As one can see, the agreement of the theoretical and experi- 
mental spectra is quite satisfying, demonstrating in particular that the experi- 
mental sample is indeed ordered [250]. 

The circular dichroic spectrum for the L 2 -edge is shown in the bottom 

panel of Fig. 12. Again a very satisfying agreement with the corresponding expe- 
rimental results could be achieved. The results for A/xlj clearly demonstrate that 
the occurrence of magnetic dichroism is by no means restricted to the white line 
region. Although the amplitude for Ap-La is quite small compared to the white 
line region, it is present throughout the whole EXAFS-range. 

As mentioned above, the applicability of the sum rules in their conventional 
form seems to be somewhat doubtful because of these findings. Nevertheless a 
clear-cut interpretation of the MEXAFS-spectra can be given making use of the 
sum rules in their differential form. In the upper part of Fig. 13 a superposi- 
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Fig. 12. EXAFS- (top) and MEXAFS-spectra (bottom) at the L 2 -edge of Pt in FesPt. 
Calculations for the ordered compound (full line), compared against the experimental 
data for the Feo. 72 Pto .28 (dotted line) [249]. 



tion of the theoretical magnetic dichroism spectra Apu 2 and Apua according to 
Eq. (102) is given (here the very small contribution ■^{Tz)d has been neglected). 
This is compared to the spin polarization of the d-states that have been 
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obtained directly from the band structure calculations. In the lower part of Fig. 
13 the superposition according to Eq. (103) of the dichroic spectra is compared 
with the directly calculated orbital polarization ^(4)d of the d-states. To com- 
pare the spectroscopic data with the band structure results the normalization 
factor Cd in Eq. (102) and (103) has been used as a free scaling parameter using 
the same value for the upper and lower part of Fig. 13. 
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Fig. 13. Top: Spin polarization ;^(<Tz)d for the d-states of Pt (full line) compared to 
those derived from the MCXD-spectra using Eq. (102) (dashed line). Bottom: Orbital 
polarization ^(/z)d for the d-states of Pt (full line) compared to those derived from 
the MCXD-spectra using Eq. (103) (dashed line). 



The nearly perfect coincidence of the various curves in the upper and lower 
part of Fig. 13 convincingly demonstrates that the primary information that 
can be deduced from circular L 2 , 3 -MEXAFS spectra is the spin and orbital 
polarization for the hiial d-like states of the absorber atom. Of course, these arc 
no pure atomic-like properties but concerning their variation with energy they 
strongly depend on the bonding to their surrounding. For this reason it is quite 
reasonable to perform a Fourier transform to the MEXAFS-spectra to seek for 
information on the magnetization distribution around the absorber atom [249]. 
However, the relationship of the magnetic radial distribution is by no means 
trivial. Nevertheless, it seems to be worth to investigate this relationship in more 
detail to be able to deduce further magnetic information from MEXAFS-spectra 
in a sound and reliable way. 
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Magnetic Dichroism in the VB-XPS Nowadays, the standard approach 
to deal with angular-resolved photoemission is to use the one-step model [251] 
together with the layer-KKR formalism [148]. A corresponding relativistic ver- 
sion for paramagnetic solids of this approach has been developed by Ackermann 
and Feder [252] and Ginatempo et al. [253] to study spin-orbit induced pola- 
rization effects in photoemission. In addition, a spin polarized relativistic and 
full-potential version has been introduced recently by Feder et al. [90] and Flucht- 
mann et al. [149] to deal with magnetic dichroic effects in magnetic solids. As 
an alternative to the above mentioned layer-KKR formalism, one can use the 
real-space multiple scattering formalism as well [254]. A corresponding expres- 
sion for the time-reversed LEED-state used to represent the hnal states of a 
photo-emission experiment has been worked out first by Durham [255]. Eor the 
spin polarized relativistic case the LEED-state is given by [256]: 

C-fK, E) = 4^/^^ E Yr^-'\k) (104) 

A 

m A' 

where k and denote the wave vector and spin character of the photo-electron. 
With the LEED-state available, it is straightforward to derive expressions for the 
photo-electron current intensity I{E,ms;uj,q,X) for any kind of photoemission 
experiment making use of Fermi’s golden rule [251,256]. For excitation with high 
energy photons, multiple scattering may be neglected for the final states, leading 
to the so-called single scatterer approximation [257]. Dealing in addition with 
angle- integrated spectra leads to a rather simple expression for the photo-current 
intensity. In the case of VB-XPS one finds in particular [89] : 

7(A,m,;u;,q,A)ocE^a3 E (105) 
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Dealing with a paramagnetic solid this expression can be further simplihed lea- 
ding to the familiar result that the VB-XPS intensity is given by the sum over 
the K- or angular-momentum j resolved DOS rii^ of the occupied part of the 
valence band with each contribution weighted by an appropriate cross-.section 
a, [258]. 

The spin polarized relativistic approach presented above has been applied 
among others to calculate the VB-XPS spectra of the disordered alloy system 
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Coa;Pti_a; [89]. Theoretical spectra for various concentrations arc given in Fig. 
14 for unpolarized radiation with fko = 1253.6 eV. As can be seen, these spectra 
agree very well with corresponding experimental data reported by Weller and 
Reim [259]. 

As mentioned above, for the paramagnetic case the expression for the photo 
current in Eq. (105) can be simplified to a concentration weighted sum over 
the products of the K-resolved partial DOS n^{E) and a corresponding matrix 
element that smoothly varies with energy [258]. This means that the XPS-spectra 
map the DOS-curves in a rather direct way. This essentially holds also for the 
spin-polarized case. As can be seen from the DOS curves of Coa,Pti_a; (see for 
example Ref. [89]) both components retain the gross features of their electronic 
structure when they are combined to an alloy system: while Co has a narrow and 
strongly exchange-split d-band complex close to the Fermi level, the Pt d-band is 
rather broad and only slightly spin-split due to hybridization in the region of the 
Co d-band. This behavior supplies a very simple explanation for the variation 
of the spectra in Fig. 14 with composition and allows one to ascribe prominent 
features as shoulders and peaks to either Co or Pt. Nevertheless, one has to 
note that these spectra are not just a concentration weighted sum of the spectra 
for pure Co and pure Pt. One reason for this is that there are non- negligible 
changes of the component-resolved DOS-curves compared to those of the pure 
constituents. In addition one can see from Fig. 14 that for the selected photon 
energy the partial photoemission absorption cross section that depends on the 
initial state energy E as well as the photon energy hio is about a factor of two 
higher for Pt than for Co. 

Performing angular-resolved photoemission experiments for disordered alloys 
seems to be profitable only in some favorable cases for which disorder does not 
prevent the existence of a rather well-defined dispersion relation [260]. While 
this applies for example to Fea;Nii_a; (see Fig. 2) this is surely not the case 
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Fig. 14. Theoretical VB-XPS spectra (left) for Co^iPti-^! for unpolarized radiation 
and photon energy hw = 1253.6 eV. The corresponding experimental data (right) have 
been taken from Weller and Reim [259]. 
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for Coa;Pti_a;. Nevertheless, deeper insight into its electronic structure and in 
particular its magnetic aspects can be obtained by VB-XPS experiments done 
using circularly polarized radiation. In the case of Coa;Pti_a; [89] but also for 
Fea;Coi-a; [261] the calculation of the VB-XPS spectra predict a non-negligible 
circular dichroism even for the angular averaged spectra that should be strong 
enough to be detected within an experiment (corresponding measurements have 
not yet been done). The circular dichroism occurs primarily in the region of 
pronounced spin-polarization and has the same sign throughout that energy 
region. This behavior differs from that of the angular resolved theoretical spectra 
obtained by Scheunomann ct al. [90] for the normal emission from perpendicular 
magnetized Xi (001) that show a circular dichroism that change sign with binding 
energy with the energy integrated dichroism spectrum essentially vanishing [90] . 
While these specific spectra map states well below the Fermi energy with both 
spin characters contributing roughly to the same extent, the angular integrated 
spectra are dominated by the majority spin character [89]. Therefore, shifting 
the Fermi energy artificially to higher energies should result in a reversal in sign 
for the dichroism in the VB-XPS spectrum with the energy integrated signal 
going to zero. 

A more pronounced circular dichroism than for the spin-averaged case is fo- 
und for the spin-polarized case, with corresponding spectra given in Fig. 15. 
Again the circular dichroism occurs primarily in the energy region showing pro- 
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Fig. 15. Theoretical spin-resolved VB-XPS spectra of Coo.ePto.r for circularly pola- 
rized radiation and photon energy huj = 1253.6 eV. 



nounced spin-polarization. Comparing the circular dichroism signal for the two 
different spin characters one notes that they are opposite in sign. This behavior 
has also been found before for the spin-resolved spectra mentioned above and 
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explained as a direct consequence of the spin orbit coupling [90]. Finally it should 
be mentioned that circular dichroism spectra similar to that shown in Fig. 15 
are found if the photon energy feu is varied. Setting feu to a value for which 
one of the components possess a Cooper minimum allows one to get additional 
component specific information. This could supply very helpful information on 
the nature of the initial states and the contribution of the various components 
for the very complex magneto-optical Kerr-rotation spectra of transition metal 
alloys and compounds [262]. 

4 Summary 

The formal background to deal with the electronic structure of magnetic solids, 
accounting simultaneously for relativistic effects, has been reviewed. The main 
emphasize has been laid on a four-component formalism that is based on a Dirac 
equation set up within the framework of DFT. As recent developments in this 
field, the inclusion of the Breit-interaction and the use of CDFT has been discus- 
sed. Solving the corresponding single site problem allows in principle to set up for 
any band structure method the necessary four-component spherical basis func- 
tions that account on a corresponding level of sophistication simultaneously and 
on the same footing for magnetic order and relativity. Here, the KKR-method 
has been used and accordingly several recent developments concerning this - as 
the incorporation of the OP-formalism, the full-potential formalism or the TB- 
version - have been discussed or mentioned. The main motivation for using the 
KKR-method stems from the fact that all features of the underlying Hamilto- 
nian are directly incorporated in the corresponding wave functions, in contrast 
to schemes that account for corrections to an unperturbed Hamiltonian within 
a variational procedure. In addition, it directly gives access to the electronic 
Green’s function. This very appealing feature has been exploited within many 
of the presented applications, as for example in the case of impurity systems 
lacking Bloch-symmetry. 

Independent of the use of the KKR-method, the presented applications de- 
monstrated that the spin-orbit coupling influences the properties of magnetic 
solids in various, quite different, ways. In some cases one gets just a correction 
to results that are obtained by a non- or scalar relativistic calculation. Of course, 
most interesting are pure spin-orbit induced properties that would not occur if 
the spin-orbit coupling or magnetic order would be absent. Nearly all properties 
discussed here fall into this category. 

Using the full Dirac-formalism to deal with magnetic systems is of course 
very satisfying from a formal point of view but makes it in general quite cum- 
bersome to give a simple interpretation of the numerical results. For this reason, 
approximate schemes, as those presented here, are very helpful and important 
for the discussion of the various spin-orbit induced properties. In particular, they 
allow to decide which consequence of the spin-orbit coupling is more important: 
lifting of energetic degeneracies or spin-hybridization. Concerning this question. 
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it was demonstrated that the relevance of this two effects can be quite different 
for a spin-orbit induced property. 

Altogether, one can claim that in general a rather satisfying quantitative 
agreement of theoretical and experimental data can be achieved when calcula- 
ting spin-orbit induced properties. Remarkable deviations occur primarily if one 
is investigating properties closely connected with orbital magnetism. The most 
promising way to remove these problems in a consistent way, i.e. in particu- 
lar within the framework of DFT, is to apply CDFT. Because of the delicate 
situation it might be necessary to perform the corresponding calculations in 
the full-potential mode and/or to use the optimized effective potential (OEP) 
scheme in addition. Anyhow, whatever will happen in this and related fields in 
the future, working with the four-component Dirac-Green’s function-formalism 
will guaranty that one is open and prepared for any new developments. 
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Abstract. A review of a state of the art, theoretical method for calculating the ma- 
gneto crystalline anisotropy (MAE) is given. The fundamentals of first principles, total 
energy calculations, i.e. density functional theory, are described. In addition one of 
the most common methods for solving the Kohn-Sham equation, the linear muffin-tin 
orbital method, is described briefly. Technical aspects and difficulties for performing 
theoretical studies of the MAE are discussed and several examples are given. It is poin- 
ted out that the orbital magnetism and the MAE often are intimately connected. The 
MAE is also argued to be connected to other details of the electronic structure, such 
as the values of the density of states (DOS) at the Fermi level, the partitioning of 
the DOS into crystal field components and the hybridization with orbitals of possible 
ligand atoms. 



1 Introduction 

Today we are witnessing how modern electronic structure theory has become a 
very useful theoretical tool in explaining and complementing experiments. Often 
accurate calculations can reproduce observed magnetic properties such as the oc- 
CTirrence of ferro-, ferri-, antiferro- and paramagnetism. Also, the direction and 
magnitude of the magnetic moments of bulk, surface and inter-phase systems 
are normally described well, as is the preferred easy axis of the magnetization 
direction, something which is called the magneto-crystalline anisotropy-MAE. 
In addition it has become possible to reproduce the finding that magnetic mo- 
ments do not always point in the same direction, as is the case in non-collinear 
magnets and systems with spin spirals [1]. Parallel to this development, theories 
of micro-magnetism [2] have advanced attempting to describe processes such as 
magnetization reversal and nucleation when the direction of an external held 
is changed. These theories describe ‘realistic’ magnetization phenomena, such 
as domain formation, domain walls (Bloch and Neel walls), magnetization ro- 
tation and curling, etc., which may not always be accessible directly to hrst 
principles theorists. However, one important parameter in these theories is the 
magneto-crystalline anisotropy (MAE). Information of this physical property is 
often given in terms of the so called anisotropy constants (e.g. Ki and K 2 ). 
Apart from being an important parameter in micro-magnetic models, a deeper 
insight about the MAE, which shows the directions in the crystal which are easy 
and hard magnetization directions, is important knowledge in itself. This is not 
least due to the possibility of designing new materials which are optimal for 
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information storage. Let us illustrate this statement with an example. A small 
fraction of a magnetic disc with the magnetic moment pointing in one direction 
may be thought of as representing a 0 in a binary code whereas another section 
with a reversed moment direction would represent a 1. It becomes important to 
have the directions well defined, and as a consequence materials with an optimal 
MAE are searched for. Recently a lot of attention has been focused on orbi- 
tal magnetism, (Lz), of itinerant electron systems, especially since it has been 
observed that there may be a connection between orbital magnetism and the 
magneto-crystalline anisotropy (MAE) [3]. As will be argued below, this means 
that one avenue to hnd a material with a large MAE is to search for a large 
orbital moment. At least materials which have a large directional dependence 
(anisotropy) of the orbital moment are expected to have enhanced MAE [3]. 

This chapter deals with theoretical aspects of the MAE of transition metals, 
its relation to important parameters such as the crystal structure, spin and or- 
bital moments, electronic structure and hybridization with possible neighboring 
atoms. Although several examples of bulk, surfaces and interfaces will be given, 
this chapter is by far not complete and should not be viewed as an overview 
of the subject. It is merely intended to illustrate ideas, give understanding and 
to show ways to think about the MAE. In this chapter we shall thus outline 
how one can calculate the orbital moment of an itinerant electron system. We 
also point out that most of the discussion in this chapter deals with elements 
and compounds where the magnetism is provided by itinerant electrons, elec- 
trons which occupy band states that have dispersion (an example of this is bcc 
Fe). This is in contrast to localized electrons which are atomic like, even in the 
crystalline environment (an example is hep Gd, a rare earth element). Calcula- 
tion of the MAE requires resolving the difference in total energy (which often 
is of the order of several thousands of Ry) when the magnetization is pointing 
in two different directions of the solid, with a requirement of accuracy some- 
times better than 1 It is the purpose of this chapter to outline some of 

the techniques used at present and to give a few examples of the theoretical 
modeling of magnetism of solids by means of first principles calculations. This 
chapter begins by reviewing the most important parts of the basis of modern 
electronic structure theory, namely Density Functional Theory (DFT) [4,5] and 
the approximations which make it useftil for calculations, i.e. the Local Density 
Approximation (LDA) and the Generalized Gradient Approximation (GGA). 
The latter two approximations give rise to explicit but approximate expressions 
of the total energy of a material. At this point we would like to draw attention 
to the fact that there exist textbooks dealing with this topic; for a more detailed 
study of DFT, LDA and GGA we refer the reader to Dreitzler and Gross [6], and 
Eschrig [7]. There are also good review articles on this such as Gunnarsson and 
Jones [8], and Callaway and March [9]. We will also describe in some detail one of 
the many methods which exist for calculating the electronic structure and total 
energy of a solid or a surface, a basis function method based on so-called linear 
muffin-tin orbitals (LMTO). There are already detailed descriptions of methods 
for calculating the electronic structure of a solid based on linear muffin-tin or- 
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bitals [10-12], However the method outlined here makes fewer approximations 
and is more accurate. In achieving this there are obstacles which must be over- 
come which have not been described before, we will try to point these out and 
compare with the previous methods [10-12], We would also like to mention that 
detailed descriptions of other computational methods, such as the Augmented 
Plane Wave (APW) method of Slater [13] and the modified, linearized version 
of it (LAPW) by Andersen [10], can be found in Ref. [14] and [15], respectively. 
Another computational method which calculates the LDA or GGA total energy 
is based on the so called Linear Combination of Atomic Orbitals (LG AO), or- 
bitals which may be recognized from text books on chemistry and physics of 
molecules [16], The LCAO method is described in detail in Refs. [17] and [18] 
and in addition calculations of the electronic structure of a variety of elements 
[19] and compounds [20] have been compiled in book form. 

2 Introductory Remarks on Electronic Structure Theory 

All theories for calculating the magnetic properties and the total energy of solids, 
surfaces and interfaces start out by adopting the Born-Oppenheimer approxima- 
tion. This approximation simply neglects the movement of the atomic nuclei and 
the electrons are considered to be moving around in a material where all nuclei 
are at fixed positions. The motivation for this is that the electrons are much 
lighter than the nuclei and thus move much faster. For the materials discussed 
in this chapter, where the lightest element studied belongs to the 3d series, this 
approximation is a very good one. One can now focus solely on the electrons, 
which in itself is a formidable problem. The electrons interact with the positive 
atomic nuclei and with each other electrons via Coulomb forces. Although the 
former interaction is by no means simple it can be treated, whereas the latter 
interaction is impossible to calculate and one must resort to approximations. 

Attempts to estimate the electron-electron interaction in solids in order to 
calculate the electronic dispersion of solids or the total energy of them dates back 
to the days of the Tliomas-Fermi model [21], the Hartree approximation and to 
the X-a method of Slater [13], An extension of these ideas culminated in what 
we today know as Density Functional Theory (DFT) [4,5], As we will see below 
this has made it possible to calculate the total energy of for instance a solid, 
using the electron density, n(r), as the key variable (n(r)=n’^(r)+n-^(r)) where 
n^(r) is the spin up electron density and n'^(r) the spin down electron density). 
For magnetic systems one has also to consider the magnetization density, m(r) 
(m(r)=n^(r)-n'^(r)), and examples will be given in this chapter of calculations 
based on DFT where the total energy and the magnetic properties of a solid are 
successfully reproduced. 
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3 Density Functional Theory 

3.1 The Hohenberg Kohn Theorem 

Let us start the description of the basic ideas of DFT by considering a non- 
magnetic system with spin degeneracy^ , the spin polarized case will be discussed 
later. It is now our purpose to prove the two theorems that are the basis of DFT. 

Theorem 1. The total energy of a system is a unique functional of the ground 
state electron density. 

To demonstrate this we consider a Hamiltonian, H = T + V + W, where T 
represents the kinetic energy of the system, V the interaction of the electrons 
with an external potential (normally this is the potential given by the atomic 
nuclei in the solid) and W the (exact but complex) electron-electron interaction. 
The solution to this Hamiltonian results in a ground state many body wave 
function '?'(ri, r 2 , ....rjv) (for N electrons), and we have 

HT = EgsT. (1) 

The electron density can now be calculated from 

n(r) = J d^r^nj^^ \ <T{ri,r2, ....tn) ^ 5(r - r^) (2) 

and the interaction V is written as V=j n{r)v{r)d^r, where u(r) is the external 

potential. We will now demonstrate that two different external potentials n(r) 
and u'(r) must give rise to different ground state electron densities. To show this 
we note firstly that for the system with potential v'(r) we have 

H'T' = (3) 

From the variational principle it follows that 

Figs = {'T\H\E) < fE' I H I E'). (4) 

By adding and subtracting v'(r) on the rhs. of (4) we obtain 

(E' I HI E') = {E' \H' TV -V \ E') (5) 

= Figs + J n'{r){v{r) — v'{r))d^r. 

Combining the expressions in (4) and (5) gives, 

Egs < £'gs + j n\r){v{r) -v'{v))d^r. (6) 

^ A more elaborate discussion of the contents of this chapter may be found in the text 
book by Dreitzler and Gross [6]. 
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A similar argument starting from the expression 

E'g, = (E' I H' I ^') <{E\H' I E), (7) 

results in 

Ks < + J n{r){v'{r) - v{r))d^r. (8) 

We will now show that if we assume that n'(r) = n(r) an absurd result emerges 
and hence this assumption must be wrong, i.e. n'(r) / n(r). The absurdity comes 
from the fact that if (6) and (8) are added and one assumes rt'(r) = n(r) one 
obtains the equation 



Egs + £^gs < -E^gs + Egs, (9) 

which is clearly wrong. Hence n'(r) ^ n(r) and we conclude that knowledge 
of the electron density, n(r), implies that we know it was calculated from a 
Hamiltonian with an external potential u(r) (since we just showed that u(r) and 
u'(r) give rise to different densities n(r) and n'(r)). As the kinetic energy, T, 
and electron-electron interactions, W, are known and specified we conclude that 
knowledge of the ground state electron density determines the entire Hamiltonian 
and hence the ground state energy, which proves Theorem 1 (although an explicit 
and practical form for calculating Egg from n(r) is not clear from the arguments 
given above) . One can thus express a functional relationship between the ground 
state energy and the corresponding electron density as 



E[n{r)] = r[?T.(r)] + V[n{r)] + W[n{r)]. (10) 

Since the arguments that lead to this relationship do not depend on the form of 
u(r) (and hence is valid for atoms, molecules and solids) the kinetic energy and 
electron-electron interaction T + W = A[n(r)] is called a universal functional of 
the electron density. 

A second important theorem of DFT is 

Theorem 2. The exact ground state density minimizes E[n{r)]. 

This statement partly follows from the fact that the many electron wave 
function is also specified by the electron density [6] , since the ground state density 
specihes the Hamiltonian and hence also the wave function (of the ground state 
and of excited states), and to illustrate this dependence we write tf'[n(r)]. To 
prove Theorem 2 we next note that for a given external potential Uo(r) we can 
from Theorem 1 write 

A„„[n(r)] =< E[n{r)] \ T + W + Vq \ E[n{r)] >, (11) 

where the subscript vq indicates that this is the energy functional for a system 
with external potential uq (r) . If we now denote the ground state electron density 
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by no(r) wc can express the ground state as !^'[no(r)]. From the variational 
principle we again obtain 

(<F[r7,o(r)] \ T + W + Vo\ tf'[no(r)]) < 

(lf'[n(r)] I T + EF + Fo I lf'[n(r)]), (12) 

which can also be expressed as 

E^„[no{r)] < E^„[n{r)], (13) 

i.c., the ground state density minimizes the energy functional E[n{r)], which is 
what Theorem 2 states. If we now had an explicit form for E\n{r)] we could go 
ahead and minimize it with respect to the electron density and in this way calcu- 
late the ground state energy. Unfortunately one must resort to approximations 
to obtain an explicit expression for E[n{r)], due to the complexity provided by 
the electron-electron interactions. We will outline such approximations in the 
next section but before we do this we note that the arguments above can also 
be repeated for spin polarized systems and one may show that the ground state 
energy is a unique functional of the electron and magnetization density. The 
proof of this is quite similar to the proof outlined above, and we start out by 
modifying the Hamiltonian to include an external magnetic field, B(r), so that 
we have H = T + W + U , where U= f u(r)n(r)— B(r)-m(r) d^r. Based on the 
variational principle we may, analogous to the discussion around Eqns.4-6, arrive 
at 



Egs < E'^s + J 


^ n'(r)(u(r) 


1 

CO 

1 


j m'(r)(B(r) 


-B'(r))dV 


(14) 


and 












Egs < Egs + 

J 


^ n(r)(u'(r) 


— v{r))(fr — 


f m(r)(B'(r) 


- B(r))dV. 


(15) 



If we again assume that n(r) = n'(r) and m(r) = m'(r) and add (14) and (15) we 
arrive at the same absurd result as in the discussion of spin degenerate systems, 
i.e. (9), and we must draw the conclusion that n(r) ^ n'(r) and m(r) ^ m'(r). 
Hence we may conclude that for magnetic systems wc can write the ground 
state energy as a unique functional of the electron density and the magnetization 
density. 

3.2 The Kohn— Sham Approach 

The theorems described above are also valid for non-interacting electron systems 
where the part of the Hamiltonian describing electron-electron interactions, VF, 
is absent. In this case electrons which move in the field of an external potential 
which, we for reasons that will be obvious below, call are solutions to a 
one-electron Schrodingcr equation, 

-V^ 

[— ^ 1- Ves]'4)i = Eii>i. 



(16) 




First Principles Theory of Magneto-Crystalline Anisotropy 253 



There is an infinity of solutions to this equation and to specify a special solution 
the subscript i is introduced. From (16) one can calculate an electron density 
from the lowest lying one-particle (op) states. If there are N electron states which 
are solutions to (16) one simply calculates the one-particle (a label introduced 
to show that there are no electron-electron interactions considered) electron 
density from 



N/2 

^op(r) = ^2 I ^i(r) |2, (17) 

where the factor 2 comes from spin degeneracy. In this case the energy functional 
which describes the total energy of the N electrons may be written as, 



-S'op [^op (l*)] = ^opf^op] I^ff['^op] = 'y I 2 I 

+ I ^op (r) 14 ff(r)d^r, 



(18) 



and the electron density which minimizes this functional is obtained from the 
requirement that the energy functional is stationary for small variations of the 
electron density around the ground state density. This can be written as, 



0 SEqp F^op[^op(^) T c^riop(r)] £'Qp[nop(r)], (1^) 

which may also be written as 

0 = (lTop[nop] + J Sn{r)Ves{r)(fr. (20) 

Carrying out the minimization in (20) leads to (16), and we have demonstrated 
that independent particles which are the solution to (16) give rise to a density 
which minimizes the total energy expression of independent particles in (18). The 
reason for introducing (16) to (20) is mainly that they can be solved, at least 
approximately, to within a desired accuracy (this is the topic of the next subs- 
ection), but at this state it is unclear if equations (16) to (20) have anything 
to do with a ‘real’ interacting system. However, as will be clear in a moment 
they can, via the Kohn-Sham approach, be used to actually calculate the ground 
state energy of a ’real’ electron system. The basic principle of the Kohn-Sham 
approach is now to assume that one can find an effective potential, so that 
nop(r)=n(r), where n(r) is the electron density of the fully interacting (‘real’) 
system. Since we know that the total energy of a system is uniquely determined 
by the electron density it seems to be an efficient route to obtain the correct 
electron density from a one-electron like problem, in our ultimate quest of cal- 
culating the ground state energy of a solid. 
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The question now is how to determine so that nop(r) becomes equal to 
n(r). To do this we recast the energy functional in (10) as 



£'[n(r)] = Top[n(r)] + J n{r)v{r)d^r 

2n(r) -n(r') , 3 ^ ,3 , 



r — r' 



drnrr’ + £^xc[?^(r)]. 



(21) 



Since we require that nop(r) should be equal to n(r) we have in the expression 
above, for simplicity, skipped the subscript op on the electron density of the right 
hand side. In (21) we have introduced the one-particle kinetic energy functional 
instead of the true kinetic energy functional of (10) and we have introduced the 
Hartree electrostatic interaction instead of the true electron-electron interac- 
tion. Hence in order to make (21) equal to (10) we must introduce a term that 
corrects for these replacements, and this is what the exchange and correlation 
energy, £'xc[T^(r)], does. Since the first three terms on the right hand side of (21) 
are possible to calculate numerically we have moved the complexity of the fully 
interacting system to Ending the exchange and correlation functional. So far it 
has been impossible to find the exact exchange and correlation functional so that 
(21) holds for all densities and all systems. However, for a uniform electron gas 
one can calculate Exc[n{r)]'^ for all values of the electron density and parame- 
terized forms of iiixc[?^(r)] as a function of n(r) is available. The local density 
approximation assumes that these parameterizations work even in cases where 
the electron gas is not uniform, but varies is space, as it does in a solid, surface 
or interface. The local density approximation introduces the following expression 
for the exchange-correlation energy [6], 



(22) 



where exc[?^(r)] is named the exchange-correlation energy density and in a para- 
meterized form its dependence on n(r) is relatively simple and may, for example, 
be found in Ref. [6]. We arc now armed with an (approximate) expression for the 
ground state energy functional and in analogy with (19) and (20) wc determine 
the ground state density from this functional by requiring that the functional 
(equation (21)) is stationary for small variations of the electron density around 
the ground state density. By doing this we obtain an expression which is quite 
similar to (20), i.e. we obtain 



0 = 6Top{n] + 




^2 ,3 , 9(£xc[n(r)]n(r)) 

I r — r' I dn{r) 



(23) 



^ This can be done in the high electron density limit [22] and in the low electron density 
limit [23] . Interpolation between these two limits gave rise to parameterized forms of 
the exchange and correlation functional of a uniform electron gas for all values of the 
density [23]. However, this interpolation is in modern electronic structure calculations 
replaced by approaches which are based on quantum Monte-Carlo simulations for 
the intermediate values of the electron gas [24] . 




First Principles Theory of Magneto-Crystalline Anisotropy 255 



Wc have now achieved our goal in finding the effective potential which (within 
the approximations and assumptions introduced) ensures that nop(r)=n(r), since 
when we compare (20) and (23) we can identify as, 

V)jfr(r) = u(r) + j ^ ^ d^r' + /Xxc(n(r)), (24) 



where 



Mxc(n(r)) 



a(e,,c[n(r)]n(r)) r . ^ ^(£xc[n(r)]) 



(25) 



All we have to do now is to solve (16) with the effective potential specified by 
(24). Since the effective potential to be used in (16) depends on the electron 
density, the property we want to calculate, one has to perform a self-consistent 
field calculation (SCF) where an initial electron density is more or less guessed 
and an effective potential is calculated from (24). This potential is then used 
to solve (16) and a new electron density is calculated from (17), which is then 
put back into (24). This procedure is repeated until convergence is obtained, i.e. 
until the density does not change appreciably with successive iterations^. Once 
a self-consistent electron density has been found one can calculate the ground 
state energy of the Kohn-Sham (LDA) energy functional (via (21)) and hence 
one of the main goals in electronic structure calculations has been achieved. 

Before entering the details of how one might solve the self consistent Kohn- 
Sham equations, (16) and (17), we note that the entire procedure outlined above 
can also be made to work for magnetic systems. At the end of the previous 
section we concluded that for magnetic systems the ground state energy may be 
written as a unique functional of the electron density and of the magnetization 
density. An alternative way of expressing this is to state that there is an energy 
functional which depends both on the majority and the minority spin density 
(since n(r) = n7(r) + n'^(r) and m(r) = n7(r) — n'^(r)^) and we can write 
iJ[n^(r), n'l'(r)]. We can then make an analogous assumption to the discussion 
around (21) and obtain a Kohn-Sham scheme for spin polarized systems via. 



S[n^(r), n'*'(r)] = Top[n^(r), n'^(r)] + j n{T)v{v)d^r (26) 

+ ^ I I + Axc[n'^(r),n-*'(r)]. 

® Normally one is forced to mix the electron density which is the output of (17) with 
the electron density which is in input for that particular loop in the SCF iteratio- 
nal procedure before one takes this mixed density and puts it in (24). The whole 
procedure of mixing is quite complex where many suggestions of how to achieve self 
consistency with as few iterations as possible have been suggested [25]. 

^ We note here that this approach has simplified the situation somewhat since the 
magnetization density is a scalar property with both magnitude and spin. In this 
analysis we are assuming that the magnetization is pointing only in one direction, 
the z-direction, of the system. 
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In a real solid the preference for occupying one spin channel (to some degree) 
more than the other is traditionally explained as due to the exchange interaction 
and the driving force for it is the electron-electron interaction in the Hamilto- 
nian. Hence in the spin polarized Kohn-Sham scheme this necessarily means 
that the exchange and correlation potential, which is supposed to absorb all 
complex electron-electron interactions, must depend both on the charge and the 
spin (magnetization) density. Turning again to studies on the uniform electron 
density is useful and parameterizations for iHxc[?T'^(r), n'^(r)], as a function of 
n^(r) and n'^(r), have been made. We can now proceed quite analogously to the 
discussion around (16) and (17) and analyze a one particle Hamiltonian with 
spin up (down) effective potentials, 

, (27) 

where the electron density for electrons with a given spin is obtained from, 

= XI I • ( 28 ) 

i=l 

Repeating the discussion which led to (24), with the only modification that we 
now require the energy functional to be stationary with regard to both the spin 
up and the spin down density, leads to effective potentials which are different 
for the two spin directions due to differences in the exchange and correlation 
potential, 

= ^('') + / I (29) 

Hence the simplest forms^of spin polarized calculations treat the spin up and 
spin down electrons separately and for every iteration in the self-consistent loop 
one solves a Kohn-Sham equation for both spin directions. The spin up and spin 
down densities are then calculated by occupying the N lowest (spin up or spin 
down) eigenvalues of the separate two Kohn-Sham equations. Since for a given 
17g(r) which may be different from Hgg(r) there may be more spin up states, eJ 
than spin down states, Ef, which have an energy lower than the highest occupied 
state (the Fermi level-£lF) it is clear how spin polarization might occur. With a 
self consistent spin and magnetization density the magnetic moment is calculated 
as f m{r)(Pr (in Bohr magneton units) and the total energy may be calculated 
from (26). 

4 Solving the Kohn-Sham Eqnations: Bnlk 

We have shown in the previous subsection that the equations to be solved for 
calculating the magnetic properties of a solid from first principles are (27)- 
(29). Before entering the details of one of the most used methods, the Linear 

® We will discuss below, in connection to the section about orbital moments and rela- 
tivity, complications which makes this scheme somewhat more involved. 
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Muffin-Tin Orbital (LMTO) mothod, we note that due to the symmetry of bulk 
materials a number of simplifications evolve. This discussion can also be found 
in, for instance, the textbook by Ashcroft and Mermin [26]. 

First of all one normally assumes in a bulk material that the potential which 
enters (27) is periodic, i.e. ^r) = "^(r + R), where R is a translation 

vector ( a Bravais lattice vector) of the solid. This periodic boundary condition 
leads to Bloch’s theorem [26] which states that as an effect of the periodicity 
of the bulk material the onc-clcctron wave function must obey the following 
condition. 



r/-W)(r + R) = e“^'^^Ji^)(r), (30) 

and we note that a vector k has been introduced. This is a vector of reciprocal 
space® and due to the translation symmetry one has only to consider k-vectors 
which lie inside the first Brillouin zone when looking for solutions to (27)-(29) 
[26]. In addition one can solve the (27)-(29) for each k-vector being separate and 
independent of the others. However, the dependence of the one-electron wave 
function on k makes the calculation of the one-electron density somewhat more 
complex since we have to include a sum over all possible k-vectors, and (28) is 
in a crystal replaced by 



= (31) 

i k 

In a similar way one often needs to sum all the possible Kohn-Sham eigenvalues 
Ei (equation (16)) to be used for calculating the total energy, (26). This is 
needed since one often writes. Top = Esum — J Veff(r)n(r)d^r). The sum of the 
eigenvalues, Egum, is often referred to as the eigenvalue sum and it is calculated 
from 

Asum = ^i,k- (32) 

i k 

In principle all k-vectors inside the first Brillouin zone should be considered in 
the sums above, but since this number is enormous one would like to replace 
the sum with an integral. However, if one does not have an analytic dependence 
of, for instance, Ei of k, we must find ways to approximate (32). In order to do 
this it is useful to introduce the concept of density of states, DOS. A derivation 
of the DOS may be found in most textbooks on solid state physics and it is not 
repeated here. Instead we quote the result, that the DOS can be calculated from, 

D{E)=Y,^ j^J{E-E,^)dPk. (33) 



Reciprocal space is spanned by the vectors Gi, defined as Gi • Rj = 2Tv6ij, where V 
is the volume of the primitive cell of the Bravais lattice. 
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With this definition of the DOS one can calculate the eigenvalue sum from 






/ ED{E)dE. 

' —OO 



(34) 



4.1 Different Types of k-Space Integration 

Having defined the DOS in (33) and the eigenvalue sum, in (34) we are now 
ready to discuss different ways to approximate the integral over the Brillouin 
zone (BZ), which is necessary in numerical methods where one does not have an 
analytical expression of Eik. First we write the eigenvalue sum as, 



E., 



/ OO If 

Ef{E)^ j^J{E-E,^)d^kdE 



(35) 



where f{E) is a step function which attains the value one below the Fermi energy 
and zero above. Usum is numerically very sensitive to the k-point sampling and 
the choice of numerical method to perform the Brillouin-zone integration, BZI. 
We will discuss three different BZI schemes [27]. In all three cases a uniform 
mesh of k-points is used, distributed as to fulfill the symmetry of the space- 
group. The BZI can in all three cases be written as weighted sums over the 
bands, i, and the discrete set of sampled k-points, kj, with weight functions, 
Wji. In the so-called linear tetrahedron method [28], LTM, the uniform mesh is 
divided into corner-sharing tetrahedra. A linear interpolation of the eigenvalues 
is performed between the k-points belonging to one tetrahedron, resulting in a 
weight function Wji. The step function is used directly by using E^ as an upper 
limit in the energy integration. A modification of the linear tetrahedron method, 
MTM, was suggested by Blochl et al. [29] In the MTM the linear weights, Wji, 
are corrected by, dwij='^j, ^Dt(£'f) {Eu — Eij) , where T is an index for the 
tetrahedra and I is an index for the k-points at the corners of the tetrahedron 
r. Dx(Ep) is the contribution to the density of states from the tetrahedron T at 
the Fermi level and Eij is the eigenvalue of the k-point belonging to the 
tetrahedron T. The MTM corrects for the curvature of the energy band to leading 
order. Another way to perform the BZI is to use a Gaussian broadening method, 
GBM, which convolute each discrete eigenvalue with a Gaussian function of 
width W. This method and the related Fermi-Dirac broadening method are very 
popular in total energy methods since they lead to a fast and stable convergence 
of the charge and spin densities. The GBM can be seen as a truncation of a 
complete series expansion of a d-function in terms of Hermitc polynomials, i/„, 
with a Gaussian weight function [30]. Then the step function, f{E), can be 
written as 



^ E 

fiE) = fo{E,W) + J2^nH2n-li- 

n—1 



Ef 



R-Ejr x2 
W 2 



W 



le 



(36) 
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Fig. 1. Calculated total energy of fct Ni as a function of the number of k-points use 
for sampling the BZ. Three different sampling techniques were used, LTM, MTM and 
GBM (see text). 



where fo{E, W) = |(1 — er/( )) {erf stands for error function) and An are 
coefficients which may be calculated analytically. In practical calculations one 
has to truncate the sum in the equation above and the resulting “step” function 
is called /at(£’;FF), if N terms are kept in the sum. If the function which is 
“smeared” by f]s[{E;W) (for instance the DOS times the energy - see equation 
(34)) can be represented by a polynomial of 2N there is no error involved in the 
truncation [30]. 

An example of how the different BZ integration methods work is show in Fig. 
1 where the total energy of fct Ni (c/a ratio of 0.945) is presented as a function of 
the number of k-points in the irreducible wedge of the BZ [31]. The calculations 
were based on a full-potential linear muffin-tin orbital method, described in the 
next section. Note from Fig. 1 that the LTM converges much slower than the 
MTM and GBM. Notice also that the GBM does not converge to the same value 
as the other two methods which seem to converge to the same value. This is due 
to the fact that in the GBM the sum over Hermite polynomials is truncated, 
resulting in an approximate step function. However, the error in the energy 
differences is much smaller [31,32]. 



4.2 The FP LMTO Method 

We have outlined above a number of concepts which appear due to the trans- 
lational symmetry of a solid, i.e. the Kohn-Sham equation must be solved for 
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a number of k-vcctors which, for a given cycic in the seif consistent ioop, may 
be treated as independent of each other. In addition one has to find ways to 
approximate the k-space summation of the eiectron states and we have given 
exampies of how one may do this. We are now ready to tackie the toughest part 
of the probiem, naineiy to solve (27). One approach is to expand the (unknown) 
one-electron wave function in a set of (known) basis functions as, 

V’i.k(r) = ^c;ikX;k(r). (37) 

i 

The sum in the equation above is truncated after sufficiently many basis fun- 
ctions have been included and the coefficients c/ik are, via the Rayleigh-Ritz 
principle [16], determined from the following secular equation, 

^^[Hw - £'ikC’/i']c(ik = 0, (38) 

i 



where 

Hw = [ Xik{'r)[-^ +Vj^^'’]xi'k{r)d^r = [ Xik{r)hef[Xi'kir)(fr (39) 
JUc ^ JVc 

and 



Ow = f xik{r)xi'k{r)(fr, (40) 

JUc 

where the integral is over the unit cell (Uc). Once Hw and Ow have been eva- 
luated the eigenvalues, (z = 1 — imax), are determined by [16] 

det \Hw - Eii^Owl = 0, (41) 

a standard numerical problem, which may be solved by existing software. 

4.3 Defining the LMTO Basis Functions 

The difficulty is now to choose a basis set which is flexible and converges fast, i.e. 
as few basis functions as possible are needed to represent with sufficient accuracy 
a given eigenfunction, One efficient basis set is comprised of so called 

linear muffin-tin orbitals [10], which may be used in a full-potential mode, as 
described by Wills [33] or with the use of the atomic sphere approximation [10]. 
Both these methods arc reviewed in other chapters of this book[34,35], hence we 
give only some of the details here. 

Let us note here that the FP-LMTO method is defined using a base geo- 
metry, which is the usual construction of muffin-tin spheres centered around 
the atoms and a region outside these spheres, called the interstitial region. In 




First Principles Theory of Magneto-Crystalline Anisotropy 261 



the muffin-tins, the basis functions, electron density, and potential are expan- 
ded in spherical waves, whereas in the interstitial, the basis functions, elec- 
tron density, and potential are expanded in Fourier series. The calculation of 
the Hamiltonian and overlap matrix elements thus involves replacing the in- 
tegrals in (39) and (40) with two parts, one coining from the muffin-tins and 
one coming from the interstitial. To be specific we break up integrals of /(r) as 
/u, f{r)d^r = f{r)d^r + f(r)d^r. 

The basis functions used in the LMTO method are best described as augmen- 
ted (or rather, inside the muffin-tin spheres, replaced) Bessel functions. These 
functions arc described mathematically as. 



JCi{K,r) 






ri£{Kr) — iji{nr)K^ < 0 
n^{Kr)K^ > 0 



(42) 



1Cl{k,y) = JCfXK,r)yi,{r) (43) 

Jl [k, r) = Ji{K, t)Tl (f ) (44) 

Je{K,r) = ]e{Kr)jK^ ( 45 ) 

In the equation above we have used, 

ye„,{r) = i%rr,.{r) (46) 

Q„(f) = ^471^ + 1 Yem{^) (47) 

Qm(f) = /^^(f), ( 48 ) 

where T is a spherical harmonic. 



4.4 The Muffin— Tin Matrix Element 

The evaluation of matrix elements over the muffin-tin region requires a mathe- 
matical description of the basis function used in this part of the crystal. In case 
the effective potential in (24) is a constant (a purely hypothetical situation) the 
eigenfunctions can be written as plane waves or as a linear combination (a so 
called Bloch sum) of spherical Hankel or Neumann functions. To be specific the 
expression is (for a mono-atomic solid) 

Xi(k,r) = |r - R|)Trm(r - R), 

R 



(49) 
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where /C is a basis function defined in (42). Since part of the crystal has an 
effective potential which is rather constant, and does not vary much (in the 
interstitial) it is meaningful to use the above quoted functions as basis functions 
in the interstitial region. However, before describing how these functions are used 
to calculate (39) and (40) let us first deal with the muffin-tin region. We first 
note that (49) 

XL(k,r) = [K.e>{n,Sr)5{R,mL,L') (50) 

R L' 

+ i7l' (^; s)Bl^i J^{k, R), ] 

can be rewritten inside a muffin-tin sphere at R=0 as a multi-pole expansion 
which yields, 

Xi (k, r) = ^ ^L' (f )(/C,, (k, s)5(L', L) + Jl' (k, s)Blcl(k, k)). (51) 

L' 

This function is continuous and differentiable at the muffin-tin spheres. Since 
the effective potential around the nuclei, inside the muffin-tins, is more or less 
spherically symmetric and atomic like an efficient choice of basis set is to replace 
the Bessel and Neumann functions in (50) with numerical functions that are 
solutions to a Kohn-Sham like differential equation. However, the numerical 
functions are only determined inside the region where the effective potential is 
close to being spherical, i.e. inside the muffin-tins. They are calculated from 
'P = aip + bip, where _ e,y)tp = 0 and pi is the energy derivative of ip. 

The constants a and b are determined to ensure continuity and differentiability. 
Hence the Bessel and Neumann functions are replaced, in a continuous and 
differential way, with numerical functions. The basis function to be used in the 
muffin-tin region is then, 

XL(k,r) = '^yi^f{r){^i>{K,Sr)S{L\L) + s)BL^L(^, k)). (52) 

With this basis function, matrix elements of the Hamiltonian and overlap can 
be evaluated over the muffin-tin region. Added to each such matrix elements is 
then the contribution from the interstitial region, which is described next. 

4.5 The Interstitial Matrix Element 

The basis function in the interstitial region is described by (49). However, since 
it has been found to be efficient to represent the effective potential and charge 
density as Fourier series, it becomes computationally efficient to make use of a 
Fourier series also for the basis function. This is possible since a basis function 
to be used in a crystalline environments can be written as a periodic function 
multiplied with a plane wave, and one can of course express the periodic function 
in a Fourier series. Hence one makes use of, 

x4k,r) = ^XGe‘e^+°)-, (53) 

G 
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in the interstitial region from which the interstitial contribution to the Hamil- 
tonian and overlap matrix elements are evaluated. Since the potential in the 
interstitial region is represented as a Fourier series the evaluation of the matrix 
element of the Hamiltonian (and the overlap) becomes relatively simple, invol- 
ving a sum of Fourier components. In practice it becomes faster to perform this 
integral via a combination of inverse Fourier transforms and Fourier transforms, 
which is not described here. 



5 Magneto— Crystalline Anisotropy of Selected Materials 

5.1 General Remarks 

In the remaining part of this chapter we will give examples of how the theories 
outlined above can be used to calculate the magneto-crystalline anisotropy, as 
well as spin and orbital moments. However, before entering the details of this 
section which describes the magneto-crystalline anisotropy (MAE) as well as 
spin and orbital moments of selected materials, it should be mentioned that van 
Vleck pointed out, already in 1937 [36], that the spin-orbit interaction is res- 
ponsible for the coupling of the spin to the lattice, which then gives rise to a 
magnetic anisotropy in a magnetic solid. Hence, much of the physical understan- 
ding of the mechanisms behind the MAE of 3d elements have been known for a 
long time. Actual calculations of the MAE have been thwarted because efficient 
methods for solving the Kohn-Sham equations, in combination with fast com- 
puters, are only recently available. Pioneering and important work in the field of 
MAE was also made by Brooks [37] and Kondorsii and Straub [38] where techni- 
cal details of, for instance, problems with BZI were analyzed. We are interested 
here in the electronic contribution to the MAE, provided by the coupling of spin 
and orbital angular momenta, via the spin-orbit coupling. There is also another 
contribution to the MAE which is called the shape anisotropy (described in the 
following section). However, in the theoretical description of the MAE below we 
shall focus on the contribution given by the spin-orbit coupling. Hence, when 
comparing theory and experiment we have, unless otherwise explicitly stated, 
removed the shape anisotropy from the experimental data. 

5.2 Shape Anisotropy 

The shape anisotropy results from the magnetic dipole interaction. To calculate 
it we can consider a lattice of magnetic dipoles with magnetic moment /r^. The 
energy of the dipole interaction can be written as [ 2 ] Adipoie = — 5 J2i l^i' 
where is the field intensity at lattice point i due to all the other dipoles. This 
interaction can be written as [39,40], 



E, 



dipole 



c2 -R.p F 



(54) 
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where is a vector describing the lattice point i. Duo to the long range nature 
of the magnetic dipole interaction the sum in (55) depends on the shape of the 
sample, and this leads to the shape anisotropy. For thin films, for instance, it 
always favors in-plane moments. 

5.3 Orbital Moments and Orbital Polarization 

An important part of magnetism, especially when considering the MAE, is the 
orbital contribution to the magnetic moment. Just as the spin moment can be 
written as the expectation value of Sz using the wavefunctions in (37) one can 
calculate the orbital moment from [41], 



= ■ (55) 

ki 

The expectation value in (55) is non-zero only if the relativistic spin-orbit cou- 
pling is included in the calculation and if the spin degeneracy is lifted. Via the 
spin-orbit interaction there is now a coupling between spin and orbital angu- 
lar momentum and it is this interaction which is responsible for the electronic 
contribution to the MAE. Hence the orbital moment and the electronic contri- 
bution to the MAE have in common the fact that they both are a consequence 
of the spin-orbit interaction and below wc will show that one can sometimes 
write an explicit relationship between the MAE and the orbital moment. From 
the arguments above it is clear that any theoretical method which attempts to 
calculate the MAE or orbital moment must consider the spin-orbit interaction. 
This can either be done by solving the Dirac equation, by use of perturbation 
theory, or to diagonalise a fully relativistic Hamiltonian using basis functions 
from a non-relativistic Hamiltonian [10,42] (or more precisely scalar relativistic 
Hamiltonian, since so-called scalar relativistic corrections such as mass velocity 
and Darwin shifts [23] are normally included in most band structure methods 
anyway). Using perturbation theory is more approximate than the other two 
methods, which give almost identical results^. The latter finding in not too sur- 
prising since the same relativistic Hamiltonian is considered, only the form of 
the basis functions is different. Diagonalising a relativistic Hamiltonian with 
scalar-relativistic basis functions has the advantage of being relatively easy and 
scalar-relativistic codes may, without too much effort, in this way be modified 
to include also the spin-orbit interaction. This can either be done in a straight 
forward way [10,42], as will be described below, or by using the so-called se- 
cond variation approach [45]. The basic principle in these two implementations 
is however the same, and we describe only the first version here. If the effective 

^ This was found for instance when comparing orbital moments of Fe, Co and Ni 
calculated using these two different methods, but also by inspection of the DOS of 
Pu [44], a very heavy element where the spin-orbit interaction is approximately 1 
eV. 
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Hamiltonian in (39) is augmented to include also the relativistic spin-orbit in- 
teraction, ^relativistic ^ ^sraiar^ (where ^ represents the strength of the 

spin-orbit interaction) the secular equation in (38) is modified as follows, 

^ ^ ^ a' — 9 . (^ 9 ) 

I cr 

In this equation matrix elements of the form 



(xi<7k(r)cr|Cl-s|(j'x/v'k(r)> 



^ ^z^z)\ ^ X/V'k(^)) 
' ' Uxs +l ''' 






(57) 



must be evaluated. Since the basis functions involve spherical harmonic fun- 
ctions, spinors, (T, and a radial component, these matrix elements are rather 
straight forward to calculate. Diagonalisation of (56) now yields eigenvalues 
which are not pure spin states. Instead, the wave function of a given eigenvalue 
is written as a linear combination of states with different I and a character. 

A practical complication of (57) is that the Hamiltonian matrix is doubled in 
size, making the numerical diagonalisation procedure slower. A further compli- 
cation of including both the spin-orbit coupling and a lifting of spin degeneracy 
is that the symmetry (of the crystal) is reduced. Consider for example a cubie 
erystal, such as bcc Fe. Normally there are 48 point group operations which 
leaves the lattice invariant. However, if magnetism is considered and if the ma- 
gnetization is coupled to the lattice via the spin-orbit interaction, the so-called 
double group [46] has to be considered, and the effect of the time reversal opera- 
tor has to be taken into consideration. If the magnetization of our cubic lattice is 
pointing along the crystallographic z-axis we can, for instance, not perform a 90 
degree rotation around the x- or y-axis, since the magnetization direction then 
is rotated out from the z-axis and the system is not invariant. However, a 90 
and 180 degree rotation around the z-axis is allowed. Also, a 180 degree rotation 
around the x-axis flips the magnetization direction from the z-direction to the 
z-direction and in itself such a symmetry operation is not allowed. However, the 
product of this rotation and the time reversal operator (which changes the sign 
of the magnetization) is an allowed operation and this element of the double 
group does leave the crystal invariant. Our example of a cubic lattice when the 
magnetization is pointing along the z-direction has 16 allowed symmetry opera- 
tions. Hence, the lattice becomes tetragonal in symmetry, an effect which we will 
discuss below in connection with calculations of magneto-striction. At present 
we simply note that the reduced symmetry simply means that the irreducible 
part of the BZ is larger, and that more k-points need to be sampled in order to 
have a well converged total energy. 

Since the spin-polarization (spin pairing) is treated via a spin-dependent 
effective potential, such as (29), and the relativistic spin-orbit interaction is 
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treated via (57), the physical mechanisms behind Hund’s first and third rules, 
respectively, are (at least approximately) included. One can at this stage ask 
the question why total energy calculations based on different approximations of 
DFT ignore Hund’s second rule? The answer to this question is of course that 
an exact formalism for including effects responsible for Hund’s second rule is 
lacking at present. However, it has been pointed out [47] that an approximate 
way to incorporate Hund’s second rule in total energy calculations, using DFT, 
is to add to the LDA (or GGA) energy functional a term, 

EoP = (58) 

cr 

In this equation E^ is the so called Racah parameter (note that this parameter 
is normally denoted B for d-electrons and E^ for f-electrons), which may be 
calculated from Slater integrals; and F® [48,49]. The correction in (58) 

is normally referred to as the orbital polarization (OP) correction. This form 
for including electron-electron interactions comes from the finding that a vector 
model, involving interactions of the form li ■ Ij between electron pairs [50,51], can 
be used to calculate the lowest energy multiplet. In fact an explicit expression 
for electron-electron interactions having a term li ■ Ij, is quoted both by van 
Vlcck [50] and Ballhauscn [51]. This term in the electron-electron interaction 
was also used by Norman et al. [52] to give a correction term which is similar to 
the form in (58). Summing the interaction, k ■ Ij, over electron pairs as well as 
replacing the average of this interaction with the sum over average interactions, 
Izilzj, gives rise to an expression for orbital correlations given by (58) [47]. The 
approximation of replacing li ■ Ij with the average is in the spirit of replacing the 
spin pairing energy, with a Stoner expression Szi Ylj ^zj = ^z- 

Since this form of the energy is absent in the LDA or GGA energy functionals one 
must add the energy of (58) to these total energy functionals. The interactions 
which arc responsible for Hund’s second rule, which in physical language are a 
reflection of that states with different angular momentum have different angular 
shape and hence a different Coulomb interaction, are now in an approximate 
way included in the energy functional. 

To illustrate the importance of (58) we show in Fig. 2 the energy correction 
corresponding to (58) for the 4f electrons of the lanthanide atoms (or ions). Spe- 
cifically Fig. 2 shows equation (58) neglecting the effect of F®, hence illustrating 
only the angular behavior of the correction. The correction in (58) is compared 
to the exact values [53], which are calculated from the energy difference between 
the lowest atomic multiplet of the /" configurations and those corresponding to 
the Grand Barry Centre (an average of the multiplets). This energy difference in- 
volves of course also spin-pairing energy and spin — orbit interaction, but shown 
in Fig. 2 is only the part which depends on the orbital angular momentum. To 
get a feeling for how important the correction in (58) is, we note that for a 
rare-earth system the value of F® is ~5 inRy which means that the correction 
in (58) can be as large as “100 mRy. From Fig. 2 we note that the agreement 
between the approximate form of the orbital part of the electron-electron inter- 
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Fig. 2. Angular behavior of the electron-electron interactions of the f-elements, as 
given by Nugent [53] and from equation (58). 



action, given in (58), and the exact values given by Nugent [53] is rather good. 
The largest disagreement is found for an f-occupation close to 1, 6, 8, and 13, 
since the exact values should be zero here (there is for instance no spin- and 
orbital pairing energy of a single electron), whereas the approximate correction 
does yield a non-zero correction. This is analogous to the fact that LSDA gives 
a spin-pairing energy of a single electron system. 

The derivation of (58) was made having nearly localized electron systems in 
mind [47] and for other systems other corrections should be tried. We have pre- 
viously mentioned the corrections of Norman [52], but there is also a third form 
suggested by Severin and co-workers [54] . We will not describe all the proposed 
corrections in detail here, but instead quote some results of calculations using 
(58), in order to illustrate the importance of these types of corrections, not only 
for f-electron systems but also for d-electron systems and in particular for orbi- 
tal magnetism. In Figs. 3 and 4 we show the calculated spin and orbital moments 
[55], respectively, of Fe, Co and Ni, including alloys between these elements. The 
calculations were using a standard LSDA calculation (with spin-orbit interac- 
tion), as well as the correction in (58). In the hgurcs experimental values arc 
also shown, and we note that an improved agreement between theory and expe- 
riment is consistently found for the orbital moment, when the OP correction is 
used. This is encouraging as well as reasonable, since the correction is supposed 
to include an interaction which is present in an interacting electron system. In 
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Fig. 3. Calculated and experimental values of the spin moments of the 3d elements, 
Fe, Co and Ni, as well as for alloys between them. 



addition we also observe that the orbital moments show a rather irregular beha- 
vior and as discussed by Soderlind [55] this is due to band hlliiig effects. Since 
the size of the orbital moment is to a large degree caused by a redistribution of 
electron states around the Fermi level, [56,57] it generally scales to some degree 
with the value of the DOS at Ep. Hence a large DOS at E-p normally results in 
a larger orbital moment and the irregular behavior of the orbital moment is to 
some degree a reflection of an irregular behavior of the DOS at Ep, for these 
alloys. In addition the crystal symmetry is important since hep Co is found to 
have the largest orbital moment. It can be argued from perturbation theory that 
for a cubic material the influence of the spin-orbit coupling strength, enters 
as For non-cubic materials the dependence is stronger, and this generally 
results in larger orbital moments. 



6 MAE of 3d Elements 

6.1 Fe, Co and Ni 

The first example of calculated MAE values which we would like to discuss is at 
first glance the simplest, namely the MAE of the ferromagnets Fe, Co and Xi. 
However, due to that the MAE is extremely small for these elements, at least for 
bcc Fe and fee Ni (of order /xeV) the task of calculating this from first principles 
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Fig. 4. Calculated and experimental values of the orbital moments of the 3d elements, 
Fe, Co and Ni, as well as for alloys between them. Calculations with (OP) and without 
(SO) the orbital polarization term are shown. 



is enormously difficult, where especially a high accuracy of the computational 
method is needed as well as a very dense sampling of the BZI. For this reason 
different values of the MAE of bcc Fe, hep Co and fee Ni have been reported in 
the literature. Some of the most recently calculated [58] values of the MAE are 
listed in Table 1. 



Table 1. MAE of Fe, Co and Ni in different crystal structures. All units in /reV. The 
MAE is calculated as the difference between the axis; 001-111 for the bcc Fe, 001-100 
for the hep Co, 001-111 for the fee Co and finally 001-111 for the fee Ni. 



MAE 


bcc Fe 


hep Co 


fee Co 


fee Ni 


ASA-force theorem [40] 


-0.5 


16 


- 


-0.5 


ASA-total energy [60] 


-2.6 




2.4 


1.0 


FP-total energy [58] 


-0.5 (-1.8) 


-29 (-110) 


0.5 (2.2) 


-0.5 (-0.5) 


experiment 


-1.4 


-65 


1.8 [59] 


2.7 



In the Table 1 we quote results obtained using the atomic sphere approxima- 
tion (ASA) where spherical potentials in overlapping spheres are used to replace 
the actual effective potential of the lattice (in equation (29)) in combination with 
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Fig. 5. Calculated and experimental values of the MAE of bcc Fe (001 vs. Ill direc- 
tion), hep Co (0001 vs. 100 direction), fee Co (001 vs. Ill direction) and fee Ni.(001 vs. 
Ill direction). Calculations with and without the orbital polarization term are shown. 




“I 1 1 1 1 1 r 

0,88 0,90 0,92 0,94 0,96 0,98 1,00 



c/a ratio 



a minimal basis set. This approximation is estimated to give an error in each 
calculated eigenvalue of a few mRy’s [10]. The full potential (FP) calculations 
make no approximation to the shape of the charge- or spin density as well as the 
shape of the effective potential (equation (29)) and as described above makes 
use of a double or sometimes even a triple basis set. One may note from Table 1 
that a variety of signs and sizes of the MAE may be obtained for Fe, Co and Ni, 
depending on which approximation one uses. The calculations of Ref. [40] seem 
to be well converged with respect to the number of k-points used, as do the cal- 
culations of Ref. [60[ . Since both these methods use the ASA the disagreement 
in the MAE of bcc Fe and fee Ni must be caused by the use of the so called force 
theorem® in Ref. [40]. This finding is consistent with the analysis of Ref. [62] 
where it was argued that for cubic systems the applicability of the force theorem 
may be limited. In the Table 1 we have quoted a value of 16 yueV for hep Co, 
using ASA and the force theorem. This actually corresponds to the wrong easy 
axis. However, this value was obtained using an spdf basis set, whereas an spd 

® The force theorem [61] states that the total energy difference between two different 
magnetization directions may simply be calculated from the difference in the eigen- 
value sum of the the two magnetization directions, using the same effective potential 
for the two directions. This approximation is correct to first order changes in the 
charge and magnetization density. 
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basis set gave a value of -29 /xcV, i.e. the correet easy magnetization direction. 
One can possibly draw the conclusion from this finding that the MAE of hep 
Co is hard to converge with a minimal basis set. Unfortunately no total energy 
calculation using the ASA [60] was reported for hep Co so that one can not 
examine if the poor convergence in the number of basis functions is connected to 
the use of the force theorem or not. The total energy, full-potential calculations 
did report on convergence tests with respect to the number of k-points, the 
width of the Gaussians used in the GBM for the BZI and the basis functions 
used (however, mostly for Ni). In fact the convergence of the double basis set 
(which is a standard size of the basis set of FP-LMTO calculations) was tested 
by a calculation which adopted a triple basis set (i.e. three s basis functions, 
three p basis functions and three d basis functions, each connecting to an enve- 
lope function with a unique kinetic energy) . Since the full-potential calculations 
have the most flexible basis set as well as a more exact effective potential (29) 
these results must be considered as the most accurate of the ones in Table 1®. 
If we now examine the MAE values of Table 1, using the full-potential method, 
we find that the correct easy axis is found for all elements except fee Ni. We 
also note that including the OP influences the size of the MAE quite strongly, 
as it did for the orbital moments in Fig. 4. In order to show this dependence 
more clearly we compare in Fig. 5 different calculated MAE values (using the 
FP-LMTO method) with experimental data. Note that for bee Fe and fee Co 
including the OP correction improves the agreement with experiment, whereas 
for hep Co the experimental value is between the two different theoretical va- 
lues. For fee Ni the OP correction has a very small effect on the MAE and the 
001 axis is calculated to be the easy axis, in disagreement with observations. As 
we shall see below, where more examples are given, first principles calculations 
normally reproduce the correct easy axis, even if the size of the MAE may de- 
viate somewhat. The fact that first principles calculations, based on the LSDA, 
arc incapable of providing the correct easy axis for fee Ni is probably connected 
to deficiencies in the form of the exchange and correlation potential in LSDA. 
From Table 1 and Fig. 5 we also note that the effect of crystal symmetry has a 
large influence on the size and direction of the MAE, since Co in the fee phase 
has a much smaller value of the MAE compared to the value of the hep phase. 
This can be under.stood from perturbation theory where, as stated, it is known 
that due to the high symmetry of the cubic crystal structure the influence of the 
spin-orbit coupling strength ((^) on the MAE enters as Since the value of ^ 
is rather small for Fe, Co and Ni, it follows that in cubic phases the magnitude 
of the MAE is tiny. For lower symmetries of the crystal a stronger interaction of 
^ on the MAE is present, resulting in larger MAE values. 

® Although this statement was debated in Ref. [60] we maintain that these arguments 
are not valid and the tests made to compare the two methods [60] are of poor quality. 
Recent work by P. James [63] show that a better converged calculation of hep Co 
yields a MAE value of ~ 20 /reV when the OP correction is omitted, with the 001 
axis as the easy direction. 
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6.2 Effects of Straining the Crystal Structure 

In this section we discuss theoretical results of the MAE and spin and orbital 
moments of tctragonally strained Fe (bet), Co (fet) and Ni (fct). Many studies 
of orbital magnetism and MAE are devoted to Fe, Co and Ni as over-layers on a 
substrate with a small in-plane lattice mismatch. This causes a strain in the over- 
layer material (Fe, Co or Ni) for thicknesses sometimes up to 50 atomic layers. 
As a rule of thumb one can estimate this strain by equating the in-plane lattice 
parameter to that of the substrate, then the out-of-plane lattice parameter of the 
over-layer is adjusted so that the volume/unit cell of the over-layer is constant 
and the same as for the elemental form of the over-layer material. The effect of 
strain on the MAE can, as we shall see in a moment, be large and since many 
experimental studies are devoted to studying this situation it is important to 
have a theoretical understanding of how crystallographic strain and MAE are 
connected. 

In this section we have chosen one primary example for which there are both 
theoretical and experimental data available; Ni on Cu (001) [64]. A few recent 
LEED studies of Co and Ni films on Cu(OOl) [65,66]have shown that these films 
can be grown epitaxially in a face centered tetragonal (fct) structure which de- 
viates with a few percent from that of the fee structure. In addition these films 
have been shown to have an out-of-plane magnetization for film thicknesses 
above ~7 atomic mono-layers [67] and an in-plane magnetization for thinner 
thicknesses [68]. This is a rather unusual behavior since in most systems the 
shape anisotropy will be the dominating term in the thicker limit and it always 
favors the in-plane magnetization. In many epitaxially grown magnetic films it is 
known that the magnetic anisotropy energy, MAE, (per atom) of the system can 
be expressed by the empirical formula, E = Ey + where Ud is the number 
of atomic layers in the film and E^ and Eg are the so-called volume and surface 
(interface) contributions to the magnetic anisotropy energy, respectively. Nor- 
mally the surface contribution (Eg) is much larger than the volume contribution 
(Ev) so that for thin film thicknesses the surface dominates and determines the 
easy axis magnetization. Normally the volume contribution is assumed to be a 
result of the material in an undistorted structure (e.g. fee) and hence is assumed 
to be small. For instance the volume contribution of fee Ni is small compared 
to the surface contribution. Hence only two terms need be considered, the sur- 
face contribution to the MAE and the shape anisotropy. With increasing film 
thickness the surface contribution becomes less important since it scales with 
the inverse of the film thickness, so that eventually the shape contribution domi- 
nates. Since this contribution always favors an in-plane magnetization, a change 
of magnetization direction as a function of increasing him thickness will be from 
out-of-plane to in-plane, in contrast to the experimental result for Ni on Cu. It 
was speculated [67] that the unusual behavior of the Ni/Cu system is driven by 
a large and positive volume contribution, Ev, to the magnetic anisotropy energy 
in the Ni him which occurs due to the tetragonal distortion of Ni grown on a 
Cu(OOl) substrate, in combination with a negative surface contribution. Eg. We 
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Fig. 6. Calculated values of the MAE (001 vs. 110 direction) as a function of crystal- 
lographic strain of fct Ni. Calculations with and without the orbital polarization term 
are shown. Also shown is the experimental value of Ref. [67]. 



will demonstrate in this section that this indeed is the case and that a breaking 
of the crystal symmetry enhances the value of the MAE dramatically. 

Using the full-potential LMTO method, for Ni in a face centered tetra- 
gonal (fct) structure was calculated [64]. is here dehned as the difference in 
total energy, between the [110] and [001] magnetization directions, per atom. In 
Fig. 6 Ey is plotted as a function of tetragonal strain, or equivalently c/a-ratio, 
of the fee lattice (in reality it is, due to the strain, an fct lattice), assuming a 
constant in-plane parameter (i.e. this distortion is not volume conserving) . (The 
in-plane lattice parameter, is designated 3.58 rA, which is the same as the value 
measured by FEED for thin Ni films on Cu [65]. The experimental lattice para- 
meter of bulk fee Ni is 3.52 rA). At c/a=1.0 the fct crystal has a volume which 
is larger than the volume of bulk fee Ni. For comparison, both the calculated 
Av with and without orbital polarization is shown. Note the large increase of 
Av due to the orbital polarization. It is found that the [001] direction is the 
easy direction for all c/a <1.0. Further, we notice that E^ is linear in c/a, to 
first order, especially when c/a is close to 1.0. This indicates that our distortions 
in the range c/a “0.88 to 1.0 are in the magneto-elastic regime. Notice also 
that in the calculations with only the spin-orbit interaction included, this linear 
behavior is not as pronounced. In addition it may be observed that due to the 
breaking of the cubic symmetry the value of the MAE becomes very large. In 



274 O. Eriksson and J. Wills 




c/a ratio 



Fig. 7 . Calculated values of the MAE (001 vs. 110 direction) of Ni, for strains along 
the Bains path. Calculations with and without the orbital polarization term are shown. 



the figure we have also displayed one experimental value obtained from FMR 
experiments [64]. In Fig. 6 we see that, for the experimental value of the c/a ratio 
(c/a=0.945, corresponding to the tetragonal distortion in Ni films on Cu(OOl)), 
the theoretical calculation gives = 140 /reV. This is in reasonable agreement 
with the measured value, £'v=60 [xeV. The most important thing to note is that 
theory and experiment agree on the sign of Ey and that they both show that it is 
substantial in tetragonal Ni. As previously discussed by Schulz and Babcrschkc 
[67], the large and positive E^ term is larger than the shape anisotropy for the 
same Ni film. This leads to a change in magnetization direction for film thickn- 
esses larger than a critical thickness of ~7 mono-layers (i.e. E^ dominates over 
the negative surface term, Es, and the shape anisotropy), which is consistent 
with observations. 

Since Fig. 6 demonstrates that a strain of the fee crystal structure influences 
the value of the MAE very strongly it is of interest to investigate the MAE over 
large regions of strain. The so-called Bains path, which connects the bcc and fee 
structures via a tetragonal strain, is especially useful here since one may calculate 
the MAE along the Bains path and observe how it is modified with respect to 
changes in the strain. For this reason we show in Fig. 7 calculated values of 
along the Bains path. The calculations were done in such a way that the volume 
is the same for all different c/a ratios. For c/a close to fee or bcc one observes an 
almost linear behavior as expected from magneto-elastic arguments. Since both 
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FeNi FePd FePt 

MnNi MnPt 



Fig. 8. Calculated values of the MAE (001 vs. 110 direction) of fct Ni as a function of 
calculated values of the orbital moment anisotropy. The behavior of (59) is shown as a 
straight line. 



fee and bee are eubic struetures, and therefore (for symmetry reasons) ean only 
have magnetie anisotropy eonstants of 4*^ order and higher, Ey must be small 
for e/a elose to 1.0 and Therefore the eurve must deviate from the linear 
behavior at some intermediate e/a ratio. In Fig. 7 it is seen that this happens 
when c/a is between 0.8 and 0.9. Further we notiee that Ey is positive in the 
interval -^<e/a <1 and negative for c/a >1.0 and <^- It seems likely that this 
information can serve as a prediction of the volume contribution to the MAE, 
of pseudomorphically grown tetragonal Ni films on any substrate (if possible to 
fabricate) . Due to the elasticity the volume of fct Ni will never deviate much from 
the volume of fee Ni and the curve in Fig. 7 should resemble the experimental 
reality. 

6.3 The Correlation between MAE and Orbital Moment 

An alternative to measuring the MAE directly (which could be difficult in certain 
cases) may be found from a simple relationship between the orbital moment 
anisotropy (OMA), which is defined as the difference in the orbital moment 
when the magnetization is pointing in two different directions and the total 
energy difference between these two directions, i.e. the MAE. Bruno showed [3], 
using perturbation theory and assuming that the exchange-splitting is larger 
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than the band width, that the following relationship holds between MAE and 
OMA, 



MAE = -^OMA, (59) 

where ^ is the spin-orbit parameter. Hence instead of measuring the MAE of a 
system one could measure the orbital moment when the magnetization is poin- 
ting in one direction and subtract the measured orbital moment of a different 
magnetization direction. Forcing the magnetization to lie in a specific direction 
may be obtained by an applied external field. Hence, if it is judged that the 
OMA is easier to measure than the MAE one could simply take the measured 
OMA and use (59) to estimate the MAE. In addition this relationship may be 
used to analyze the behavior of measured MAE values of compounds, interfaces 
and so on. Since the relationship between OMA and MAE relies on different 
assumptions it is of interest to investigate how well this relationship holds for 
different systems. We present one example here, Xi on Cu, and show that the 
relationship between MAE and OMA actually is linear, which is what (59) sug- 
gests. To illustrate this we plot in Fig. 8 the MAE of fet Ni (the same calculation 
as described in the section above) as a function of the difference in orbital mo- 
ment (OMA) when the magnetization is pointing in the 001 and 110 direction. 
Since the calculation of fet Ni was made for several values of the c/a ratio, se- 
veral different values of both the MAE and OMA may be calculated and it is 
these data which are shown in the figure. The relationship between MAE and 
OMA are also shown in Fig. 8, using a value of 105 meV for Overall, the rela- 
tionship between the MAE and OMA is roughly linear although for certain c/a 
ratios there are larger disagreements. This probably implies that the assumption 
of an exchange splitting which is larger that the band width is starting to break 
down. 



7 Selected Compounds 

There can be quite strong modifications of the MAE as well as spin and orbital 
moments when a 3d element is situated in a compound. A good understanding of 
the mechanisms which modify the MAE in one direction or the other is important 
since one then can start tuning material parameters in order to optimize the 
magnetic properties. We will outline here some observed trends and explain 
their origins. We illustrate this with some recent results by Ravindran et al. [69] 
on the FeX and MnX compounds (X=Ni, Pd and Pt). 

7.1 FeX and MnX Compounds (X=Ni, Pd or Pt) 

The MnX and FeX compounds crystallize in a layered tetragonal structure with 
two atoms per primitive unit cell. Experimentally the MAE has been measured 
for a large number of compounds of this type, some which are compiled in Ref. 
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Fig. 9. Calculated values of the MAE (001 vs 100) of FeX and MnX compounds (X=Ni, 
Pd or Pt). 



[69]. We will by no means report on all the details of the theoretical work of 
Ref. [69] but merely quote the results which are of largest importance for the 
discussion here. In Fig. 9 we show the calculated MAE of iso-structural FeNi, 
FePd and FePt as well as of MnNi and MnPt. This figure illustrates that for 
inter-metallic compounds the influence of the spin-orbit coupling of the ligand 
atom strongly influences the MAE. This finding was also observed for multi layer 
systems [70] . In particular one observes that a larger spin-orbit coupling of the 
ligand atom results in a larger value of the MAE. The reason for this is that the 
magnetism of the 3d atoms will induce a magnetic moment also in the ligand 
atoms, even if these atoms are non-magnetic as pure elements. 

In Fig. 10 the effect of the exchange-splitting of the 3d atom is illustrated 
and we observe that a larger exchange-splitting results in a larger MAE. Since 
the MAE is a result of spin-orbit coupling in combination with spin polarization 
the results in Figs. 9 and 10 are maybe not too surprising but it is not entirely 
obvious that there should be a rather smooth relationship between the MAE 
and exchange-splitting, as Fig. 10 shows. All details of the calculated results for 
all studied FeX and MnX compounds are reported in Ref. [69] and are not given 
here. However, we note that theory obtains the correct easy axis of magnetization 
for all these compounds [69], although the absolute values sometimes deviate 
from experiments with as much as 50%. 
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Fig. 10. Calculated values of the MAE (001 vs 100) of TPt (T=Ni, Pd or Pt). 



8 Surface and Interface Magnetism 

When an atom is close to an interface or a surface it will experience a diffe- 
rent environment compared to its elemental environment and as a consequence, 
the wave function overlap between atoms, the hybridization and the direct hop- 
ping will be different. In short, the electronic structure will be modified. In this 
section we will illustrate how this rearrangement of the electronic structure pro- 
duces magnetic properties such as spin and orbital moments and in addition how 
it modifies the MAE. These effects are possible to measure on an almost atomic 
level. For instance, the surface eontribution to the MAE and the spin and orbi- 
tal moments have been measured [66,67,71,72]. In addition to describing some 
arcetypal features of surface and interface magnetism we will illustrate with a 
few examples how, from an electronic structure point of view, one can use the 
knowledge of these modifications in order to influence the magnetic properties 
in a desired direction. 



8.1 Spin and Orbital Moments of Selected Surfaces and Interfaces 

The example to be discussed in this section is that of Co grown on Cu (001) 
[71]. This system is chosen since it is known experimentally that thin films of Co 
grow in a well characterized way (fee) for up to 50 atomic layers, which enables 
measurement of the thickness dependence of the spin and orbital moments. In 
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the experiments of Tisher et al. [71]. X-ray diehroism was combined with the 
sum rules [73], which connect x-ray diehroism to the size of magnetic moments. 
The advantage with this experimental technique is that it gives atom specific 
information about the magnetism. By studying the thickness dependence of the 
ratio between orbital and spin moment one can calculate the interface, bulk and 
surface contribution to this ratio. The so obtained ratio between orbital and 
spin moments are in Table 2 compared to theoretical values [71]. Note that both 
for the interface and the surface the ratio between orbital and spin moments is 
increased over the bulk value. Theory is in good agreement with this finding, 
and the observed trend with the largest ratio between orbital and spin moments 
for the interface and smallest ratio for bulk, is reproduced by theory. Table 2 
also lists the individual spin and orbital moments for the bulk, interface and 
surface atomic layers. As may be observed both the spin and orbital moments 
are enhanced for atoms which have a chemical surrounding which differs from 
bulk. However, the enhancement is larger for the orbital moment than it is for the 
spin moment. The enhancement of the spin moment may simply be understood 
from the fact that atoms which are close to the surface or close to the Co/Cu 
interface have a reduced number of nearest neighboring atoms. From a simple 
tight binding model, where the band width scales as the square root of the 
coordination number, the reduced bandwidth and enhanced spin moments at a 
surface or even an interface (if the interaction-hybridization between the atoms 
is weak over the interface) may be understood. 



Table 2. Spin and orbital moments for 1 ML of Co on Cu (001), bulk fee Co and Co 
surfaee (001). 



quantity 


1 ML Co/Cu 


(001) Co (fee, bulk) Co (fee, surfaee) 




0.261 


0.134 


0.234 


Ms(/Ub) 


1.850 


1.724 


1.921 


Mb/Ms (theo.) 


0.141 


0.078 


0.122 


Mb /M s (expt.) 


0.195 


0.078 


0.113 



In Fig. 11 we display as an example the DOS of bulk fee Co and the DOS 
projected on a Co atom for a mono-layer of Co on Cu (001). The narrowing 
of the DOS at the surface is clear from this figure. Hence, since reduced band- 
widths almost always (but curiously not always) produce enhanced spin mo- 
ments, the trends of the spin moments in Table 2 may be understood. Before 
entering the discussion about orbital moments we remark here first on the fact 
that even though the DOS at, for instance, a surface may become substantially 
more narrow compared to bulk, details of the hlling of the bands as well as the 
hybridization with the underlying substrate may in rare cases produce reduced 
moments. An example of this is Ni on Cu (001) [74]. Let us now turn to the 
enhancement of the orbital moment at the interface and at the surface. For the 
surface it has been pointed out that the crystal field is different and that hence 
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Fig. 11. Calculated density of states of bulk (hep) Co and a Co monolayer on Cu (001). 
The Fermi level is marked by a vertical line. 



the crystal field quenching is reduced at the surface [75]. This tends to enhance 
the orbital moments. Moreover, since the bands become more narrow at the 
surface the value of the DOS at Ep is enhanced compared to bulk producing 
larger orbital moments. Finally we observe that orbital moments appear only 
when one theoretically includes the relativistic spin-orbit interaction in a spin 
polarized calculation. A spin degenerate calculation, even if it includes the spin- 
orbit interaction, will yield a zero orbital moment. Thus, a spin moment which 
is reduced, approaching zero, produces an orbital moment which also is reduced 
and approaches zero. Conversely, increased spin moments, such as the ones for 
the surface and interface in Table 2, generally produce larger orbital moments. 
We have thus identified three mechanisms which for atoms at, or close to, the 
surface result in enlarged orbital moments. For the interface atoms all the above 
applies, and using these arguments one may also expect enhancements, at least if 
the interaction (hybridization) with the atoms across the interface is sufficiently 
small. Previously we discussed the simple relationship between MAE and OMA. 
Since the size of the orbital inomcnts often is larger for surfaces and interfaces 
it is likely that the orbital moment anisotropy (OMA) also is enhanced, from 
which it follows that the value of the MAE is larger than it would be in the 
bulk. In addition to this the symmetry is lowered and the spin-orbit interaction 
influences the MAE more strongly (than ^^) which also produces larger values 
of the MAE at surfaces and interfaces. 
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Fig. 12. Calculated OMA (full line, see text) and experimental magneto-striction co- 
efficient of FeCo (bcc, dashed-dotted line) and FeNi (fee, dashed line) alloys. 



Analysis of the MAE of different types of over-layers, for instance of Co 
on Au(lll) [76], Fe on Au(OOl) [77], and Co on Cu(OOl) [78]gives additional 
valuable information. For the case of Co on An(lll) [76] it was demonstrated 
that enhancement in the MAE could be understood from details in the DOS 
projected on the Co site, and in particular a large DOS at Ep with the ‘correct’ 
symmetry explained the large values of the MAE for certain thicknesses of an 
Au capping layer [76]. This way of explaining variations of the MAE was also 
reported by Pick and Dreysse [79] who argued that a large MAE is expected 
when a large value of the DOS (of XY and YZ character) is situated on Ep. 
Another important finding was also reported in Ref. [79], where the MAE of free 
standing mono-layers of Fe, Co and Ni was shown to have a minimum number 
of nodes as a function of band-filling. 



9 Magneto— Striction 

For strains close to the cubic phase, bcc and fee, the change in the MAE is 
linearly dependent on the strain, e (e=c/a-l). Using arguments from magneto- 
elastic theory, the MAE of small tetragonal distortions can be used to calculate 
the magncto-striction coefficient, Aqoi, as illustrated in Ref. [80] for fee Ni. A 
similar scheme has been used by Wu et al. [81]. The total energy can be written 
as the sum of an elastic and a magneto-elastic energy {Ee\ and Ume) which are 
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assumed to be quadratic respectively linear in small distortions, E^\ =C'e^ and 
£inie =cr B e. Here e is the volume conserving tetragonal distortion (e" 2(c/a- 
l)/3), Of is a constant which takes the value 1 for magnetization parallel to 
the tetragonal axis and for a perpendicular direction, and B and C are the 
magneto-elastic and tetragonal elastic coefficients, respectively. The magneto- 
striction coefficient, Aooi, is defined as the equilibrium distortion, i.e. where the 
total energy, Ef.\ + E-^^e , has its minimum, for the case with the magnetization 
along the [001] direction. By differentiation of the total energy with respect to e 
this turns out to be A = — ^. The uniaxial MAE is, for small distortions, related 
to the magneto-clastic energy as, Emae = = 3eCA. Hence one may connect 

the A and MAE values and compare their trends directly [32]. Hence, in order 
to study trends in the MAE as a function of, for instance, alloy concentration 
one may, as an alternative, study experimental trends in A or the trend in the 
OMA, and this is something we will do next [32]. Before entering the details of 
such a comparison we comment that a rigid band approximation works well for 
studying the trends in the MAE and OMA of alloys involving Fe, Co and Ni 
[32]. In Fig. 12 we compare the calculated OMA of bcc Fe and fee Co to the 
corresponding experimental magneto-striction coefficients, A. All analyses [32] 
indicate that it is sensible to compare these two properties, at least their trends. 
Note that the theoretical calculations reproduce the experimental trends very 
well. For the FeCo alloy (bcc) theory reproduces the drop in A with increasing Co 
concentration and the dip at a Co concentration corresponding to ~2.5 spin down 
d-electrons. The trends in A of the FeNi (fee) alloys is somewhat more intricate 
where A changes sign two times as a function of alloy concentration. However, 
theory and experiment agree rather well and we conclude that magneto-striction 
is a property which, at least concerning the trend, is accessible for state of the 
art, total energy calculations [32]. 



10 Summary 

To summarize, we have outlined some of the most central aspects of first prin- 
ciples calculations of the magneto-crystalline anisotropy, and the spin and or- 
bital moments of elements, compounds, interface and surface systems. One of 
the more commonly used methods for solving the Kohn-Sham equations, the 
full-potential linear muffin-tin orbital method is also reviewed, briefly. We give 
several examples of the accuracy of theory for reproducing the MAE, the spin 
and orbital moments, and the magneto-striction coefficients. A more detailed di- 
scussion of these effects can however be found in Ref. [63] . The most important 
results reviewed here is that the anisotropy of the orbital moment very often is 
connected to the MAE and the orbital moment may be influenced by a ‘suitable’ 
tuning of the electronic structure, via geometry, ligand atoms, crystal field split- 
ting and so on. A theoretical engineering of suitable magnetic properties, such as 
magnitude of the moment and its directional behavior is now starting to become 
feasible. By good knowledge of how the electronic structure, and especially the 
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DOS projected on a specific atom type, is modified for different crystallographic 
environments, it is possible to predict materials with novel magnetic properties. 
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Abstract. The ab-initio self-interaction corrected (SIC) local-spin-density (LSD) ap- 
proximation is elaborated upon, with emphasis on the ability to describe localization 
phenomena in solids. Two methods for minimizing the SIC-LSD total energy functio- 
nal are considered, one using an unified Hamiltonian for all electron states, thus having 
the advantages of Bloch’s theorem, the other one employing an iterative scheme in real 
space. Moreover, an extension of the formalism to the relativistic case is discussed. 
Results for NiO, cerium and cerium compounds are presented. For NiO a significant 
charge transfer gap is produced, in contrast to the near vanishing band gap seen in the 
LSD approximation. Also, the magnetic moment is larger in the SIC-LSD approach 
than in the LSD approach. For the cerium compounds, the intricate isostructural phase 
transitions in elemental cerium and cerium pnictides may be accurately described. A 
sizeable orbital moment for elemental cerium metal is obtained which, upon lattice 
expansion, is seen to reach the atomic limit. 



1 Introduction 

Density Functional Theory (DFT)[1] is a very powerful tool for performing ab 
initio electronic structure calculations for complex systems. It provides an exact 
mapping of a many-body electron problem which occurs in solids onto a one- 
electron problem. Instead of considering, for N interacting electrons in an exter- 
nal potential V(,xt(i'), the 3N-dimensional Schrodinger equation for the wavefun- 
ction !F(ri, T 2 , ra, ..., rjv), DFT expresses this many-body problem in terms of 
the electronic density distribution n(r) and a universal exchange and correlation 
functional of the density, Exc[n]. The task of solving the many-body problem is 
then reduced to hnding sufficiently accurate expressions for Exc[n] and then sol- 
ving the relevant one-electron Schrodinger equation with an effective potential of 
which the exchange-correlation potential is a prominent part. Generalizing DFT 
to the spin-polarised case, the spin-density functional theory (SDFT) allows to 
study magnetic systems, where the magnetization density is the order parameter 
of the theory, and appears explicitly in the exchange-correlation energy functio- 
nal, i.e. , ExJn,m]. 

H. Dreysse (Ed.): Workshop 1998, LNP 535, pp. 286-312, 1999. 

© Springer-Verlag Berlin Heidelberg 1999 




Self-Interaction 



287 



The local-spin-density (LSD) approximation to SDFT provides a simple and 
rather successful schcme[2]. This is owing to a simple and practical approxima- 
tion for £ia;c[n, m], where the exchange and correlation energy of the electrons 
in the solid is expressed in terms of the exchange-correlation energy per par- 
ticle, £a;c(n, m), of a homogeneous electron gas of homogeneous density n and 
magnetization m. Specihcally, in each point in space the electrons present are 
assumed to contribute an exchange-correlation energy given as if they were in a 
homogeneous electron gas at the local density n{r) and m(r): 

Exc^[n,m] = j n(r)e„c(n(r),m(r))dr, 

The simple function £a,c(n,m) is known with great precision, and hence allows 
for an accurate determination of the ground state energies and charge densities 
of any system. 

The LSD is a highly accurate approximation for systems where the electrons 
arc delocalized and travel ’fast’ through the solid. However, when the ’static’ 
electron-electron interactions become so strong that some electrons get localized 
on atomic sites in the solid, the LSD, as well as its gradient corrections, fail 
to describe the correct groundstate. This obviously means that the Exc[n,T^] 
can not anymore be adequately represented by the LSD. For when the electron 
slows down upon localization it starts responding to a different potential than 
the effective LSD potential. 

One of the most prominent examples where electron correlations are too 
strong to be properly treated within LSD is the high-temperature superconduc- 
tors, for which LSD fails to produce the antifcrroniagnctic and semiconducting 
ground states of the generic La 2 Cu 04 and YBa 2 Cu 3 0e compounds [3]. Other 
examples are the 3d transition metal oxides MnO, FeO, CoO and NiO, which 
are Mott insulators characterized by localized d-electrons [4,5]. These materials 
have antiferromagnetic order and large energy gaps although the metal d-shell is 
incompletely hlled. The LSD approximation does reveal the magnetic ordering 
but with somewhat too small magnetic moments and vanishing (for FeO and 
CoO) or very small (for MnO and NiO) gaps [ 6 ]. In addition, the persistence of 
the magnetic moments above the Neel temperature is difficult to explain in the 
Slater-Stoner picture of magnetism inherent in the LSD band picture. The failure 
of LSD in producing the correct gaps may be traced to the fact that the LSD 
eigenenergies do not have built in the large on-site Coulomb repulsion, which cha- 
racterizes the separation between occupied and unoccupied states [5,7,8]. LSD 
also fails to give a physically correct picture of trivalent cerium compounds, 
which are characterized by each Ce atom having a localized /—electron. Cerium 
is the first element of the Periodic Table to accomodate an /—electron. This 
4/— electron is peculiar in being spatially localized with a radial extent much 
smaller than that of the 5s and 5p semicore states, yet having an energy in the 
region of the valence 6 s and 5d electrons. Cerium metal has been widely studied 
over the years, both experimentally and theoretically, mostly due to its famous 
isostructural 7 — phase transition, occuring at a pressure of ~8 kbar. This 
phase transition is believed to occur as a consequence of a change in bonding 
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properties of the /—electrons. In the (high-volume) 7 phase the /—electrons 
behave as non-bonding localized inonients with a Curio- Weiss type susceptibi- 
lity, while the (low-volume) a-phasc is characterized by the f-electrons taking 
more active part in the cohesion, either by hybridizing into the Bloch states, as 
envisaged in the Mott transition model [9], or by forming a complicated Kondo 
lattice groundstate [10]. In an LSD calculation the total energy minimum is fo- 
und in the region somewhat below the a phase volume. Upon expansion of the 
lattice, the 7 — )■ a transition is signalled by the onset of magnetization around 
the observed lattice parameter for the 7 phase [11]. The calculated pressure is 
much too negative, however, since the /-electrons still contribute significantly to 
the cohesion in the spin-polarized phase. The loss of cohesion upon localization 
has been successfully described within the LSD in cases where the f-shell is half 
filled, as in americium [ 12 ]. 

The concept of an electron being ‘localized’ is not particularly well-defined. 
In a solid a periodic array of deep core states is equally well described by a set 
of atom centered Wannier states or by k-dependent Bloch functions. A unitary 
transformation connects the two representations, but the physical picture of a 
core state is that of a localized wavefunction. The observation of multiplet effects 
in photoemission experiments on 3d monoxides and rare-earth systems seems to 
suggest that the local description is most appropriate for such systems. On the 
other hand the great success of modern solid state theory in explaining most 
conventional metals, semiconductors and insulators leaves little doubt that the 
Bloch picture of normal valence electrons is the more fruitful one. 

The Hubbard model [13] has often been used to describe materials where lo- 
calization phenomena occur. The substantial Coulomb correlations on particular 
atoms, say on a Ni d® ion in NiO, induce an orbital polarization [5], which may 
well be described already in a mean-field treatment of the Hubbard model, as 
has been shown in the LDA-I-U approach [8,14]. However, the ‘localizing’ Hub- 
bard term is usually too large to facilitate an accurate description of delicate 
localization-delocalization transitions, as for example observed in rare-earth and 
actinide systems. 

Another way of taking into account the ’slowing down’ of an electron upon 
localization is to consider orbital-dependent functionals, Exc\{i’a}]-, where [xpa] 
is a set of one-electron orbitals with the orbital index a. These functionals 
are a possible evolution on the LSD and can be considered as a basis of the 
orbital-dependent DFT. They can be treated as the starting point for new ap- 
proximations to DFT. The self-interaction-corrected LSD (SIC-LSD) is one such 
approximation [15]. It is based upon the observation that localized orbitals in 
the LSD give rise to an error due to a spurious self-interaction contained in 
the effective one-electron potential. This error increases the more localized the 
orbital becomes. For a ’fast’ electron, however, the self-interaction is negligible 
or zero. The SIC-LSD energy functional is constructed from the LSD functional 
by explicit subtraction of the self-interaction term from the term. As we 

will demonstrate in this paper the SIC-LSD functional allows for an improved 
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treatment of the electron correlations, and is more adequate than the LSD for 
systems where both localized and delocalized electrons arc present. 

Solids where both localized and delocalized electrons are present give rise to 
some of the most facinating phenomema in solid state physics. They include the 
heavy fermion compounds and their complicated phase diagrams at low tem- 
peratures, the still unexplained phenomenon of the high Tc superconductivity, 
the observation of collossal-magneto resistance, with its potentially wide ran- 
ging technological applications. The theoretical investigations have usually been 
confined to either studies of generic model Hamiltonians which allow for a very 
accurate treatment of the electron-electron interactions, or materials specific 
LSD calculations with a not fully satisfactory treatment of the ’static’ electron- 
electron correlations. 

As already mentioned, the basic assumption in the SIC-LSD is that ‘loca- 
lized’ electron states experience a different potential from that of the normal 
delocalized valence electrons. One may argue for such a differentiation by thin- 
king in terms of Wigner delay times of electrons on a particular atom [16]. If an 
electron resides on a given atom for a long time the local electronic structure 
of the atom accomodates to the presence of the added electron, while a fast 
electron has no influence on the effective structure of that atom. This picture 
is implemented by assuming a fast (‘delocalized’) electron to experience an ef- 
fective potential as given by the LSD approximation, while a slow (‘localized’) 
electron experiences the LSD potential corrected for the self-interaction of the 
electron in question. Taking the dP configuration of a Ni ion in NiO as an ex- 
ample, one of the 8 d-electrons residing on a particular atomic site experiences 
Coulomb interactions with 7 other d-electrons. On the other hand, a conduction 
electron injected into a perfect NiO crystal will be made out of excited states, 
which ride on top of an array of Ni d® ions, and therefore Coulomb interacts 
with all the 8 d-electrons. That the LSD is adequate for describing the ‘fast’ 
electron is validated by the great success of this approach in describing the bon- 
ding properties of conventional metals. This picture is still very idealized, since 
the transition from localized to delocalized must be a gradual one. The added 
electron in NiO still may show some localized characteristics, and the ground 
state electrons do sometimes show k-dependent dispersions. However, one may 
be able to distinguish phases of solids being either predominantly ‘localized’ or 
predominantly ‘delocalized’, which will be the subject of the present paper. 

In Section II the general formalism of the SIC-LSD approximation will be 
discussed, and in Sections HI and IV two implementations of the approach into 
ab-initio linear-muflin-tin orbitals (LMTO)[17] band structure codes will be ou- 
tlined. Section V is devoted to a relativistic extension of the formalism, while in 
Section VI we concentrate on applications. Section VII concludes the paper. 



2 The SIC Formalism 

In the LSD all electrons feel the same mean- field single particle potential, 
and the same magnetic field, Bg//-, which depends on the total charge density, n, 




290 



W.M. Temmerman et al. 



and magnetization density, m. The LSD energy functional, has the form: 



with 



= Skin + U[n] + 
+ [ dr m(r) • Bext(r) 



dr V;xt(r)n(r) + 



i'kin 


= i'^c,\f\lpa) 

a 


U[n] 


1—1 1 


Vnir) 


J |r-r'| 


Kxt(r) 


T + T — r 

T,r ' ' 


E^f^[n,m] 


= J dr n{r) s^c{n{r) 



( 1 ) 



Here, T is a kinetic energy operator, and the ipaS are the one-electron wave- 
functions which in the nonrelativistic as well as the scalar relativistic case are 
two-component spinors. Bext(r) is the external magnetic field, Zt is the atomic 
number of site t, T is a lattice translation vector of the Bravais lattice, and r is 
the postion vector of site t in the unit cell, n and m may be expressed in terms 
of the ipa’s in the usual way 



n(r) = ^n„(r) = ^ i;*^{r)4>^{r) 

a a 

m(r) = ^m„(r) = ^ V’aW'T'V’aW, (2) 



with cr being the spin operator. 

Minimizing the above energy functional by taking the functional derivative 
of E^^^ with respect to ?/:*, leads to the single particle wave equation of the 
form 

= (T + V^^^ir) + a ■ BLSD(r) ) ^„(r) = V’aW, (3) 

with 

= vnir) + Vt^^ir) 

^LSD. N ^ SE^^^[n,m] 

"" ^ ’ 6n(r) 

BLSD(r) = Bext(r)+BLfD(r) 

tjLSd.x ^ SE^^^[n,m] 

"" ^ ^ dm(r) 



(4) 
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As discussed in the introduction the LSD successfully describes many so- 
lid state properties, but suffers from a deficiency caused by the spurious self- 
interaction (SI). Namely, such contributions to as U[ri], and 

contain spurious self-interactions of the single particle charges, n^, and magnetic 
moments, iiIq,, which do not cancel. The exact SDFT energy functional does not 
contain any self-interaction, and also in some existing approximations, e.g., the 
Hartree-Fock theory, the SI terms cancel out. Of course, due to its great merits, it 
does not seem justified to discard the LSD altogether in cases where this inherent 
self-interaction matters. Instead, in such cases, it is sufficient to augment LSD 
with terms removing this deficiency. The resulting approach, the self-interaction 
corrected LSD (SIC-LSD) formalism, [15] is defined by the following functional 



with 





^SIC-LSD _ ^LSD ^ ^SIC 


^SIC 


= - '^ea[na,nia] = u[n^ 




Q a 


U[na] 


= ^ / dr ?^a(r)E/ya(r) 


Hi, a 


— / dr' 

/ ^ |r -r'l ■ 


E^c^[na,ma] 


= j drna{r)exc{na{r),ma{r)) 



(5) 



The corresponding single-particle wave equation, obtained by taking the func- 
tional derivative of £;SIC-lsd respect to ■!/)*, reads 

-f ) 7/;„(r) = ^ A„,a' f/'a' (r), (6) 

a' 



with given in Eq. (3) and 

-( F„,„(r) + ) 

^E^c^[na,mg] 

Snc{r) 
dm„(r) 

Note that the problem of finding the single-particle states is now complicated 
by the fact that each state sees a different potential, Instead of determi- 

ning the energy eigenvalues, it is therefore necessary to evaluate the Lagrange 
multipliers matrix. A, to ensure orthonorniality within the set of states ipa- Fur- 
thermore, we should remark that in contrast to £;SIC-lsd jg invariant 

with respect to unitary transformations in the space of the states Suppose 



„,SIC 



„LSD 



(r) 



,SIC 



(r) 




292 



W.M. Temmerman et al. 



we perforin a unitary transformation among the occupied orbitals specified by 
the matrix [7 = 1 + i{dS), with {dS)ai,a 2 = {dS)a 2 ,ai = £*• Then the states 
tpai, and change to first order in e by is-ijja 2 and respectively. Using 

the definition of (Eq. (6)), we find the first order change of £1®^® to be: 

d£lSic = is + C.C.. If we intend to study the ground 

state properties we should therefore choose the states ipa so that they minimize 
£;Sic respect to all unitary transformations, that is to say, they fulfil the 
so-called localization criterion 

~wlf\i’a2) = 0. (7) 

It appears that the SIC-potentials are significant for those states whose 
charges are localized in space. Thus the importance of SI corrections for atoms, 
molecules and core electrons in solids seems obvious. However, our main interest 
concentrates on the valence electrons in periodic solids. If extended Bloch states 
are used to describe these electrons, the corresponding ui®^*®’s turn out to be 
negligible. If, on the other hand, spatially localized states are used the rc®^*® may 
be an attractive potential contribution of considerable size. In the remainder of 
this and the following section we will describe one method of solving the SIC- 
LSD energy minimization problem of Eq. (6), while section IV describes another 
yet equivalent procedure. 

In what follows we assume that the electronic system in question may be 
described by Wannier states each centered around some atom t in some unit 
cell, displaced from the central unit cell by the lattice translation vector T. For 
some of these states, defined by 4)a with a = (m,T), the Sl-corrections are 
assumed to be negligible, while for the others, defined by 'ipp with f3 = (n, T), 
the Sl-corrections have to be taken into account explicitly. Indices n and m 
thus enumerates the Wannier states within the unit cell. For a periodic solid we 
assume a periodic repetition of localized states, i. c., 



V’n,T(r) = </'n,T=o(r “ T) = tpn{r - T). 


(8) 


= <?!>m.T=o(r - T) = <pm{r - T). 


(9) 


The same symmetry applies to the SIC-potential of state ipn,T- 




<?(r) = - T) = wf^{v - T), 


(10) 


and the Lagrange multipliers matrix as 




An,T;n',T' = — T'). 


(11) 


Considering the states and 4>m as the elements of the column vector and 

their complex conjugates as the elements of the row vector , Eq. (6) may be 
rewritten in matrix form as 


(^L®D(r) ^ ^ ^ ^A(T-T')^( 

T' 


r-T'). (12) 
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Here, 14 /SIC diagonal matrices, and the subblock of acting on 

the states cj)m is zero. Since the A’s arc chosen to make the states <P orthonornial, 
the Lagrange multipliers may be expressed as the following matrix elements of 
the SIC-Hamiltonian 

A(T - T') = ^ f dr<P^r + T - T' + Ti)(iL^®°(r) 

J Qu-nit 

+tySiC(j.^Ti))^>(r + Ti). (13) 

When deriving Eq. (13) wc have made use of Eqs. (8) to (10). To construct the 
Bloch state vector, we form the following lattice sum 

^"k(r) = exp(— ikT) tP(r + T), (14) 

T 

while the inverse transformation is 

(?(r -I- T) = — fdkM(k) exp(ikT) !Z'k(r)- (15) 

Here, the k-integration is over the Brillouin zone (BZ) whose volume is (Ibz- 
The unitary matrix M will be elaborated upon at a later stage. By acting from 
the left with the operator 



A = ^M^i(k)exp(-ikT), 

T 

on the wave equation for ^(r -|- T), Eq. (12), one obtains the following wave 
equation for the Bloch state vector tf'k(r) with the wave vector k: 

( + V^^^(r) ) tf'k(r) = Ak <^'k(r). (16) 

Here is a diagonal matrix whose elements, , arc given by 

= ^^(M“^(k)),,,„exp(ikT)u;®^^(r + T) 

n T 

^u(r + T), (17) 

where the subscripts n and ly refer to bands. The Lagrange multipliers matrix, 
Ak, is defined as 



Ak = ^exp(ikT) M“i(k) A(T) M(k). (18) 

T 

Note that has lattice translational symmetry: + T) = V|f^^(r), due 

to the symmetry of the Bloch states: l^k,i/(r + T) = cxp(ikT)!?'k,iy(r)- In practical 
applications is chosen to be nonzero only for the bands which are expected 
to be extremely narrow, i.e. having well localized Wannier states. We shall call 
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the states corresponding to such bands the SI corrected Bloch states, and label 
them by c, while the rest of states shall be referred to as the non-SI corrected 
Bloch states, and labelled by nc. 

It is straightforward to write in the usual way in terms of the Bloch 

states, (I'k, while is expressed through the Wannier states — T) as: 

^ _ ^en[nn,m^]. (19) 

n,T 

Due to the translational symmetry of the localized states, and making use of 
Eqs. (8) to (10), the localization criterion, Eq. (7), takes the following form 

^ f dr^;^(r + T) (tc„,(r + T)-zu„,(r + T + Ti-T2)) 

rji ’J ^uriLL 

(j* + T + Ti — T 2 ) = 0. (20) 

This relation is valid for any pair of states V-'ni and V-'ri 2 > and arbitrary translation 
lattice vectors Ti, T 2 . Here the integration is over the volume of the central unit 
cell. By inserting this relation into Eq. (13) it is easy to see that the subblock 
of the Lagrange multipliers matrix A, built by the SIC-corrected localized states 
is hermitian 



A„n„2(Ti - T 2 ) — A*2,„i(T 2 - Ti). (21) 

Finally, it remains to define the matrix M(k) introduced in Eq. (14). The 
requirement is that this matrix, via Eq. (15), generates well localized states, 
ipn, that satisfy the wave equation (12), and for which the SI corrections are 
significant. The following expression turned out to meet such requirements 

M(k) = U (22) 



with 



= N,{^) [ drlFi(,,(r)iZ'k=o..(r) (23) 

J n 



and 



^.(k) = (El / (24) 

,2' 

Here the integration is over the volume 17, obtained by applying all symmetry 
operations of the crystalline point group to the unit cell. The matrix U is deter- 
mined numerically so that the localized states are orthonormal and fulhl the 
localization criterion (Eq. (20)). 
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3 The Unified Hamiltonian Approach 



In practical applications we have to iterate Eqs. (4), (6) and (16) to (18) to 
self-consistency. In particular, the Lagrange multipliers matrix, Ak, has to be 
evaluated to ensure the orthonormality of the Bloch states, iJ'k- An alternative 
method, developed in relation to the Hartree-Fock theory, consists in solving the 
eigenvalue problem for a hermitian operator, the so-called unified Hamil- 

tonian 

huM ^ ■ (^^) 

To define huM appropriate for the SIC-LSD method[18] we introduce the projec- 
tion operators, Pk, onto the subspaces of the SI corrected Bloch states, ff'k.c, and 
the projection operators, Qk, onto the subspace of the non-SI corrected Bloch 
states, Pk.nc, namely 



Tk = ^Pk.c = ^I'Z'kx) (tf'k.cl 

C C 

Qk. ^ ^ ^ Qk,nc ^ ^ ^ |l^k, nc) (^^k,nc|* 



(26) 



Subsequently, hu,k is dehned as 

hu,k = Pk,c' + ^ Pk.c t^kx Pkc 



c,c';(c7^c') 



+ Qk ' 



,SIC 



Pk,< 






SIC\ 



C ^k,( 



Qk 



(27) 



or alternatively as 



^u,k 



= h 



LSD 



Y1 ^kx ^kx 



- ^ Pkx(^"^®°+^kx^)^kx 

c,c' \(c^c') 

+ Qk. 



(28) 



To fulfil the hermiticity of hu.k, the transformation matrix, M, and the localized 
states, ?/>„, have to be chosen so that the following quantity, named A^, is real 



T,n 

V'„(r -b T). 



dr !Fk_c(r) exp(-ikT)ui„(r + T) 



(29) 

To show that Eqs. (25) and (16) are equivalent let the Hamiltonian huk, as 
dehned by Eq. (28), act on the SI corrected Bloch state, l^kx- This gives 

+ Uk^f^) llf^kx) = £kx |!^kx) 

+ ^ (#'kx'l^'^®°+<fl'^kx) l-^kx'). 

c';(cVc) 



(30) 
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Similarly, for the non-SI corrected state !f"k,nc we get 

h \^U.,nc) — ^k,nc \^k.,nc) 

-^(il^k.c|<fVk.„c> (31) 

C 

When interpreting the matrix elements on the right-hand side of Eqs. (30) and 
(31) as the elements of the Lagrange multipliers matrix, Ak, then Eqs. (30) and 
(31) are, indeed, equivalent to Eq. (16). The eigenvalues £k,^ are the diagonal 
elements of A. 

The transformation from real space to k-space, as dehned by Eq. (14) and 
applied to Eq. (7), leads to the following localization criterion for the Bloch 
Stat0S 



f «if(r) - «E!S(r))$^k,..(r) = 0. (32) 

Inspecting the r.h.s. of Eq. (30), containing the Lagrange multipliers of the sub- 
space of the SI corrected Bloch states, one can see that the above equation 
guarantees the hermiticity of the corresponding subblock of Ak. Summarizing, 
the unihed Hamiltonian approach is seen to yield both the orthogonalized SI 
corrected and non-SI corrected Bloch states, as well as the Lagrange multipliers 
matrix, all just from one matrix diagonalization per k-point. 

The unified Hamiltonian scheme may readily be incorporated into the linear 
mufSn-tin orbital (LMTO) band structure method, [17, 35] and in what follows 
we shall describe the necessary steps in some detail. 

The lattice Fourier transforms of the muffin tin orbitals (MTO’s), Xo-./,k(r), 
are used as a set of one-particle basis states. The label / stands for the angular 
momentum quantum numbers Z,m and the site r corresponding to the head of 
the MTO, while a denotes the spin component. The space coordinate is decom- 
posed as r = p -|- T, with p restricted to the central unit cell. The Bloch states 
have the form 

>^'<T,k,^(p,r) = ^<^d.k(P>'r)- (33) 

/ 

The wave vector coefficients, and the one particle energies, £o-,k,jy, are 

obtained by solving the following eigenvalue problem 

y ( (Xer,/,k|^Mk|Xcr,/',k)) -^erj'-k ~ 

I' 

^(Xcr,/,k|Xa,/Mc) 

I' 






(34) 
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Making use of Eq. (27) leads to the following form of the matrix elements of the 
unified Hamiltonian 

(Xt7,/,k|httk|Xcr,/',k) = (XtT,/,k|^' |Xcr,/gk) 

c,c',C7^c' i' 

AT,. ) {1'<r,Kc' \Xa,P,A 

+ ^(X<7,/,k|'?'cr,k,c)('?'<T,k,c|'fk?f |!^rT,k,c)(3'(T,k.c|X<7,/',k) 
c 

X( ^^(X(T,/,k |i^(T,k,nc) (l^cT,k,nc |t^k,c l^cr,k,c) 

nc,c 

A'a,k,c\Xa,I',A + C.C. ). (35) 

In the second term on the right-hand side of the above equation, we have inserted 
a complete set of the LSD Bloch states, with the corresponding eigen- 
values Next, instead of using the MTO’s, we can express all quantities 

in terms of the normalized single-site solutions, cj), and their energy derivatives, 
(j), at fixed energies and in the spherical potential and the magne- 
tic field In the representation of these states ^gj^i{p) = <pcr,i{p), and 

^<t,/. 2 (p) = (t>a-,i{p), the Bloch states read as 

{P,r) = Yl “a (p) ■ (36) 

The relation between the coefficients aT on the one hand and the coefficients 

on the other, is provided by the standard LMTO formalism. [35] Similarly, 
the MTO’s may be written in the form 

X(T,/.k(P)'^ ) = ^ ^ (p) ■ (3’i') 

I',i 

Inserting Eq. (36) into Eq. (15) we obtain the corresponding representation of 
the localized states, r/jn 

V>„(r + T) = Mp,r-, T) = ^ cTlr,T^’a,I,^{p) (38) 

l,m,i 

with 

C'ScT = J dkMn,u{k)exp{ikT)aTi.,.,k- 
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Making use of Eqs. (36) to (38) and (17), we can calculate the matrix elements 
of the unified Hamiltonian, with 

(Xcr,/,k|^(T,k,i/) — ^ ^ ^(7,1 ,1' 9l',i 

I',i 

= ^M-„i(k)exp(-ikT) 

n,T 

X I ,,,,(/□). (39) 

Here gi^i = and we have approximated the SIC potential by its 

spherical average, over the ASA sphere at site r in the cell T. Then 

it is also straightforward to express the energy functionals and in 

terms of quantities defined above. 

As can be seen from Eq. (35), the SIC-LSD implementation is much more 
complicated than the LSD, because the Hamiltonian matrix elements depend 
explicitly on the Bloch states Consequently, one has to iterate the appro- 

priate set of equations until the self-consistency for the one-particle potential, 
yLSD, achieved, and in addition, the unified Hamiltonian becomes consistent 
with its solutions. In practice, it turns out, that the self-consistency with res- 
pect to all necessary quantities, can be reached within a moderate number of 
iterations, if the Hamiltonian, hu\ for the iteration is evaluated using the 
Bloch states of the previous iteration. In contrast, the transformation matrix, 
M, the localized states, ■i/'n, and the SIC potentials, are derived from the 

(i) 

eigenstates of hu . In the first iteration one starts with LSD Bloch states. The 
LSD potential, is obtained by mixing the total charge densities of previous 

iterations in the usual way. In practice, the evolution of the total energy, £ltot, is 
a suitable measure of the progress made in approaching overall self-consistency: 
If E-tot is converged to within 10“^ — 10“® Ry, then also all other quantities turn 
out to be sufficiently well converged. 

In contrast to the LSD, the SIC-LSD formalism gives a great deal of variatio- 
nal freedom to the DFT. At the beginning of each calculation one has to decide 
on the number and the character of the localized states -!/>„, e.g., their orbital 
moment- and spin-quantum numbers, and the sites to which they belong. This 
choice depends on the nature of the system in question and is guided by physical 
intuition. In general, it is not unique. We have the freedom to try different, but 
reasonable, ’configurations’. After carrying each of them to self-consistency the 
resulting total energies, Etot, are compared, and the configuration corresponding 
to the lowest total energy is considered to provide the best description of the 
physical situation, as described within the SIC-LSD scheme. Of course, there al- 
ways exists a possibilty that the LSD will provide the minimum of Etot, meaning 
that the SI corrections are of no importance for the system in question. 
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4 The Steepest Descent Approach 

An alternative way to solve the SIC-LSD equations, Eq. (6), is by the steepest 
descent method, [20] whereby the ^sic-lsd fuj^ctional is minimized iteratively. 
If at some point approximative solutions 'ipa are given, the energy may be furt- 
her minimized by adding a correction proportional to the gradient of £jSIC-lsd^ 
Specifically, 

^ tfa + S'lpcn (40) 



with 



= -X Q 



j^SIC- 



LSD 



where Q projects onto the space orthogonal to the occupied states: 



0 = 1 - XI 

a 



(41) 



(42) 



The gradient in Eq. (41) is given by the SIC Hamiltonian in Eq. (6), so that 



Sipa = 



-b W®^^)V(a - X 



(43) 



L 0 J 

The localization criterion, Eq. (7), may be fulfilled by adding to ■0a a correction 
term given by 



50a = -y'^ra,fS 0/3 


(44) 


0 






(45) 



while a third type of correction term is needed to keep the orbitals xpa orthonor- 
mal: 

13 

with 

ta,l3 = (0a 10/3) ~ (^a,/3- (47) 



The steps (43), (44) and (46) may be interchanged and the parameters x and 
y chosen according to the problem at hand. At convergence all steps vanish, 
and it is seen that a vanishing step in Eq. (43) corresponds to Eq. (6), a vanis- 
hing step (44) corresponds to Eq. (7), and a vanishing step (46) corresponds to 
orthonormality, ta,p = 0. 
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When the one-electron wavefunctions are expanded in a basis set, Xij as 

i 

where i is a composite index labelling the degrees of freedom of the basis, the 
steps (43), (44) and (46) are turned into matrix operations on the vector a of 
expansion coefficients. In our actual implementation, the Xi’s are conveniently 
chosen as TB-LMTO functions. [21] in which case i labels atomic sites and angu- 
lar momentum and spin quantum numbers, as discussed in connection with Eq. 
(33). The steps then read 

(ia“ = -o:(0-i -77) ■ (H + V“) -a“, (49) 

5a“ = -y'^ra,f3a>^ , (50) 

0 

(ia“ = ^ (0-1 - 77) • O -a“, (51) 

respectively. Here, O, H and V“ are the overlap, the Hamiltonian, and the 
SIC-potential matrices: 

= iXilXj), (52) 

H., = iXi\h^^^\Xj), (53) 

V“ = iXilwf^lXj)- (54) 

and 77 is given by 

occ. 

nij = ■ af* af , (55) 

0 

where the ^-function ensures that only projection on states with the same spin as 
ipa is considered. In a periodic lattice, O, H, and 77 arc translational invariant. 
For 77 this follows from the translational invariance (8), which for the expansion 
coefficients means that (setting a = (n, T), with T a lattice translation): 

n,T n,0 /rr‘\ 

“(T-f-R)i ^ “(T+R-T)L • 

Here 7 enumerates the angular momentum characteristics of the TB-LMTO 
function. Only the SIC potential matrix V“ is not translational invariant, and 
requires special attention. As in the unified Hamiltonian approach the transla- 
tional invariance is exploited by switching to k-space: We assume R running 
over a finite cluster (A7 unit cells with periodic boundary clusters, where M 
must be so large as to ensure that the SIC states vanish outside this region). 
Let k enumerate the corresponding reciprocal space, i. e., the M x M matrix 
UkR = ^ unitary: UU'*’ = U+U = 1. Then Eqs. (49)-(51) hold for the 

Fourier transformed quantities as well, except for the product V“a“, which must 
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first be evaluated in real space and Fourier transformed afterwards. Specifically, 
let 



^{T+R)L ~ (X(T+R)L|^n,¥!V'n,T) ■ (57) 

Then, with 

= (58) 

we get 

5o"(k) = -X (O-i(k) - 7I(k)) • [H(k) ■ o"(k) + 5”(k)] , (59) 

5a”(k) = -y ^r„,„/(k) a” (k) , (60) 

n' 

and 

Sa‘'{k) = ^{0-^ -n) 0-a‘' (61) 

for the gradient-step, the unitarian-mixing step and the orthonormalizing step, 
respectively. The r-inatrix in k-space in (60) is simply given by: 

r„,„-(k) = ^K'£(k) . 5^Jk) - 6^'£(k) . a” Jk)] . (62) 

TL 



5 The Relativistic Extension 

For many applications it is desirable to take account of all relativistic effects, 
including the spin-orbit coupling. This is not only important when we are con- 
cerned with systems containing heavy atoms, but also if we, e.g., intend to get 
reasonable results for properties depending on orbital moments and their cou- 
pling to the spins of the electrons. Furthermore, the localized states resulting 
from the SI corrections are especially sensitive to the relativistic effects. There- 
fore, we generalize the SIC-LSD formalism, described in the previous section, to 
a fully relativistic spin-polarized case. The steps involved are in close analogy to 
the derivation of the relativistic LSD-LMTO method by Ebert [22]. 

The LSD wave equation of the scalar relativistic theory, where the two spin 
channels, as well as the orbital- and spin-moments are decoupled, is now replaced 
by the Dirac equation. [23] The relativistic generalization of Eq. (3) reads as 

^LSD-REL ^ (yREL ^LSD ^ , B^®°(r))V>a(r) = £at/'a(r) (63) 



with 



= a ^V+ i(/I-/). 



(64) 
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Here a, (3 are the Dirac matrices, while I is the unit matrix. The wave functions 
spccihcally their kets, are now four-component column vectors, wlieras their 
bras arc the row- vectors with complex conjugate elements. 

The equations (1) to (32) remain valid in the relativistic case provided one repla- 
ces the operators cr and T with the matrices j3<j and respectively [23]. The 

other operators should be interpreted as diagonal matrices. The main complica- 
tion arises from the nature of the single-site wave functions of the Hamiltonian 
^LSD-REL xhey are to a good approximation simultaneously the eigenfunctions 
of the orbital moment, I, and the z-component, jx, of the total angular momen- 
tum, j . In the case of | /i | < / -I- 1 /2 there exist two independent solutions for these 
quantum numbers {is = 1,2), and each solution is the sum of two components 
(ic=l,2), having total angular momentum jic=i = 1 + 1/2 and jic =2 = I ~ 1/2, 
respectively. In the case of \fi\ = I + 1/2, on the other hand, there is only one 
solution which in addition is an eigenfunction of the total angular momentum 
j = 1 + 1/2. The relativistic generalization of the single-site wave functions, 
of the scalar relativistic theory, as introduced in Eq. (36), become 

+ (1 - Sm, 1+1/2) <!>\%sAp) ( 65 ) 



with 



( 66 ) 

Here R\ ^ is the radial part of the ic component of the single-site solution 
is, and the index / comprises the indices as well as, the site-index r. The 
K = 1 corresponds to the large component and «; = 2 to the small component 
of the bispinor 'Pi+c,is,i- Then Uc^k, takes the values l\+ = ^ 2,2 = ^1 h ,2 = ^ + 1, 
and I 2 + = 1 — 1. The spinor functions are combinations of the spherical 

harmonics, the spin functions, Xsj tlie Clebsch-Gordon-coefhcients of 

the well known form 



Yy^,i(p) = ^ < j,ix\l, /I - s, 1/2, s)Yi^^_s{p)Xs- 

S 

The matching of these single-site functions to the interstitial LMTO envelope 
functions, and the construction of the relativistic MTO’s has been described by 
Ebert [22] and will not be repeated here. With the information given above, 
it is easy to see that Eqs. (33) to (39) apply to the relativistic case as well, 
provided the subscript a is omitted, and the index I{l,m,T) gets replaced by 
the indices {I{1, /i,T),is). Furthermore, an evaluation of the occuring quantities 
implies matrix- and vector-operations in spinor space. Since the dimension of the 
eigenvalue problem is doubled in comparison to the scalar relativistic case, it is 
obvious that fully relativistic calculations will be considerably more expensive. In 
addition, the computation of the matrix elements, especially those introduced 
by the SIC terms, is rather tedious and requires careful programming. As an 
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illustration, we conclude this section by displaying the relativistic version of Eq. 
(39), namely 



(k) exp(-ikT) 






hi, is, is' 



(67) 



with 



ic 

41*y..'(p)+ / P^dp 6f^’“"(p,r;T)i?(^^_,(p) 

ic,ic' 

R%,is'Ap) -J2AjicJic,P,l2,ic) I p2dp&f^’“"(p,r;T)i?® ,, ,(p) 

ic 

RfLs'Ap)) 



( 68 ) 



and 

Q{ji,j 2 ,P,l) = ^ 2s < jip|i,p-s,l/2,s) 

s=(-l/2,l/2) 

(j 2 M|^,M - S, l/2,s). (69) 



6 Applications 

In this section we concentrate on results obtained within SIC-LSD for NiO, 
cerium metal and cerium monopnictides. For all these systems LSD fails in de- 
scribing the correct physics. In NiO, as mentioned earlier, LSD underestimates 
the magnetic moment and leads to a vanishing band gap due to an inadequate 
treatment of the on-site Coulomb repulsion among 3d electrons. In solids contai- 
ning cerium it is the / electrons that are not correctly represented within LSD, 
and the attention here is turned to a variety of structural and magnetic phase 
transitions, beyond the reach of LSD. 

6.1 NiO 

The calculated key parameters of NiO as given by LSD, LDA+U, SIC-LSD and 
Hartree-Fock theory are quoted in Table 1 and displayed in Fig. 1. Applying 
SIC-LSD to NiO is seen to lead to a substantial band gap of 3.15 eV, at the 
experimental volume, which compares reasonably well with the experimental 
value of about 4.2 eV [27]. Also the magnetic moment is improved with respect 
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Fig. 1. The band gap, magnetic moment and the total energy of NiO in the antiferro- 
magnetic unit cell as a function of the lattice constant, as calculated by LDA-I-U and 
SIC-LSD. 



to the LSD result, and the lattice parameter is nearly spot on. The experimental 
values of the magnetic moment fall between 1.66 and 1.90 /is [8]. That SIC-LSD 
provides better treatment for the 3d electrons in this system is also supported 
by comparison with the rcsrdts of LDA+U calculations [24], performed within 
the spin polarised generalisation of the method. The latter is yet another scheme 
that takes the on-site Coulomb repulsion, U, explicitly into account, however, U 
is often treated as a parameter and chosen such that some quantity agrees with 
its experimental value. In Fig. 1 we show the results of both schemes for the 
band gap, spin magnetic moment, and total energies, as functions of the lattice 
constant. The numerical values calculated by LDA-I-U and SIC-LSD agree well 
both between themselves and with the experimental data. It is interesting to 
note that according to both the LDA-I-U and SIC-LSD calculations the band gap 
in NiO increases with increasing pressure. The band gap was found to behave 
similarly in LSD. The value of the band gap is sensitive to the degree of charge 
transfer from Ni to O. If this charge transfer increases, the band gap decreases, 
because the unoccupied Ni d states move down in energy, while the occupied 
O p states move up in energy. We conclude that the SIC is the mechanism for 
producing a sizeable band gap in NiO, but the actual value of the gap reflects 
the charge transfer in the system. 

In Table 1 we compare the SIC-LSD results for the band gap, lattice pa- 
rameter and bulk modulus with the experimental results and those from other 
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Table 1. Parameters characterizing the electronic structure of crystalline NiO and its 
structural stability, as calculated by LSD, LDA-I-U (with a 0=6.2 eV) and SIC-LSD 
[24]. The Hartree-Fock ( H-F ) values and experimental data for NiO were taken from 
Towler et al. [25]. 



Quantity/Method 


LSD 


LDA+U SIC-LSD H-F 


Experiment 


lattice constant (rA) 


4.08 


4.19 


4.18 


4.26 


4.17 


band gap (eV) 


0.5 


3.0 


3.15 


14.2 


4.2 


B = (Cii + 2 Ci 2)/3 (GPa) 230, 236“ 


182 


220 


214 


145, 205, 189 



a : Ref. [26] . 




Fig. 2. Comparison of the experimental electron energy loss oxygen K edge spectrum of 
NiO with the density of empty oxygen 2p states of NiO, as calculated by LSD (0 = 0), 
LDA+U {U = 6.2 and 8 eV), and SIC-LSD. 



calculations within LDA+U, LSD and Hartrcc-Fock method. Note that, unlike 
in case of the SIC-LSD, the results for LSD and LDA+U refer to the full poten- 
tial implementation of the LMTO method. The effect of the full potential can be 
seen in the LDA+U total energy curve in Fig. 1, at higher values of the lattice 
parameter, as compared with the one due to SIC-LSD. This, however, has not 
affected the minima of the respective curves, as can be seen from Table 1. 

A further demonstration of the advantages of SIC-LSD over LSD is presen- 
ted in Fig. 2, where we compare the K edge EELS spectra of NiO with the 
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V (a.u.)^ 



Fig. 3. Cohesive energy of Ce (in mRy/atom) as a function of atomic volume (in 
ao/atom). The curve marked ’SIC-LSD’ corresponds to the calculation with one loca- 
lized /—electron per Ce atom, while the curve marked ’LSD’ corresponds to itinerant 
/—electrons. The common tangent marks the phase transition. 



calculations performed within LSD, LDA-I-U and SIC-LSD. The structure of the 
experimental EELS spectrum shown in Fig. 2 is dominated by the the dipole 
{filled ls}-H>{cmpty 2p} transitions. Figure 2 shows that the unoccupied O 2p 
density of states (DOS) calculated using LSD does not agree well with the expe- 
rimental EELS spectrum. The peaks in the EELS spectrum are associated with 
the hybridization between oxygen 2p and nickel 3d, 4s and 4p states, respectively, 
and our analysis shows that the latter two peaks are practically unaffected by 
Hubbard correlations in the 3d shell. The separation between the two main pe- 
aks, namely, the O 2p-Ni 3d and O 2p-Ni 4p peaks, as seen in the LSD DOS 
curve, is approximately 2 eV larger than the separation between the same peaks 
in the experimental spectrum, and the spectral weight of the low-energy peak 
in the DOS calculated using LSD is far too high. Performing the LDA+U eal- 
eulation for the O 2p DOS, with U=6.2 eV, leads to a significant improvement 
over LSD. Considering SIC-LSD result, that unlike LDA+U does not contain 
any adjustable parameters, the agreement with experiment is rather good. The 
SIC-LSD O 2p DOS shows slightly reduced separation between the two main 
peaks in the spectrTim, and thus improves the agreement with the experimental 
EELS spectra, as compared to LSD. 

6.2 Cerium 

As mentioned in the introduction, LSD was not able to explain the 7 — ta phase 
transition in cerium although it gave a magnetic solution for the lattice parame- 
ter close to the experimental value of 7 — Ce. The SIC-LSD, however, provides 
a unified description of this transition with the total quenching of the magnetic 
moment and volume collapse of 24 %. The calculated transition pressure of — 1 





Self-Interaction 



307 




Fig. 4. Spin (s) (in hb) and orbital (o) moments as a function of volume (in (a.u.)®) 
within the relativistic SIC-LSD approximation in 7 — Ce. 



kbar compared favourably with the value of —7 kbar, extrapolated from the ex- 
perimental phase diagram to T=0K, at which the calculations were performed. 
The volume collapse associated with the transition is shown in Fig. 3, where the 
calculated total energy of Ce as a function of volume is shown [28-30]. The two 
different curves correspond, respectively, to the a— phase (curve marked LSD; 
SIC-LSD reduces to the LSD for delocalized electrons) and 7 — phase (curve 
marked SIC-LSD). The LSD minimum of the total energy, corresponding to the 
a— phase, is located in the non-magnetic region at V = 168 (a.u.)^, while the 
spin magnetic moment is m = 1.32ps at the SIC-LSD total energy minimum, 
corresponding to the 7 — phase. Within these ab initio calculations, the 7 — s-a 
transition can be viewed as the transition betweeir the phase with fully localized 
/ electrons and the phase, where the / electrons are fully delocalized. 

Performing the fully relativistic SIC-LSD calculations allows one to study 
also the orbital moment of the 7 -phase of Ce [31]. In Fig. 4 the orbital moment 
can be seen to extrapolate to the atomic value of 3 at large volumes. This is the 
localization of one / electron within the relativistic SIC-LSD theory that suffices 
to describe the substantial spin and orbital moments in 7 — Ce. The relativistic 
SIC-LSD scheme allows for each of the 14 possible /—states in Ce to beeome 
localized. As seen in Table 2, one finds that these 14 possible solutions have 
quite different orbital momeirts and correspond to different total energies. The 
14 solutions have the block structure of 2+6+6. The ground state solution has 
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the characteristics of a good crystal field-like state: sizeable spin and orbital 
inomcnts which arc anti-parallel aligned. The other solutions are substantially 
different from what one would expect from crystal field considerations. We find 
6 solutions with energies 9.4 mRy higher than the lowest energy solution. These 
solutions are characterized by an orbital moment which is over 4 times smaller 
than the one of the ground state, but a spin moment substantially unaltered. 
Finally we have 6 solutions, situated 6.3 to 7.9 mRy above the minimum, whose 
orbital moment is essentially zero and whose spin moment is smaller than that of 
the other solutions. The total energy differences in Table 2 could be interpreted 
as excitation energies to populate each of the 14 localized states, corresponding 
to different spatial symmetry. We note that the solutions fulfill time-reversal 
symmetry: 7 spin ups and 7 spin downs, nearly all pairwise equal. The small 
deviations from exact spin up and down symmetry should be considered as the 
measure of the accuracy of our calculations. In Table 2 we also present the 
contributions to the spin and orbital moments due to the SIC localized states 
only, and respectively. Regarding the total orbital moment, one can 

see that nearly the whole contribution comes from the localized state, whilst 
in case of the total spin moment 74% comes from the localized state, and the 
delocalized states contribute the remaining 26%. 



Table 2. Calculated total energy differences with respect to the ground state (in mRy), 
orbital and magnetic (in iib) moments of the 7— phase of Ce. Also quoted are the local 
orbital and magnetic moments of the localized state. 



M, 




L. 




AE 


1.310 


0.960 


-2.290 


-2.240 


0.0 


-1.309 


-0.966 


2.317 


2.305 


0.0 


1.259 


0.923 


-0.053 


0.034 


6.3 


1.275 


0.935 


-0.060 


0.028 


6.7 


-1.257 


-0.896 


0.054 


-0.032 


6.9 


-1.266 


-0.913 


0.021 


-0.131 


7.0 


1.271 


0.913 


-0.024 


0.128 


7.9 


-1.271 


-0.912 


0.032 


-0.120 


7.9 


1.293 


0.986 


-0.536 


-0.432 


9.4 


-1.293 


-0.985 


0.536 


0.432 


9.4 


1.293 


0.985 


-0.532 


-0.429 


9.4 


-1.293 


-0.985 


0.532 


0.429 


9.4 


1.293 


0.985 


-0.510 


-0.408 


9.4 


-1.293 


-0.985 


0.535 


0.432 


9.4 



The solution with the lowest total energy is consistent with all three Hund’s 
Rules, and the spin and orbital moments are anti-parallel aligned. This gives 
one confidence that the relativistic SIC-LSD scheme forms a bridge between 
the atomic and band pictures, and enables one to obtain a good description of 
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both the localized and itinerant properties of a rare earth metal. The SIC-LSD 
relativistic band theory preserves all characteristic features of the 7 — to; transition 
in Cc, and explains it as a transition from a localized state with maximum spin 
and orbital moments to a delocalized state without spin and orbital moments. 



Table 3. Calculated transition pressures for the electronic and structural phase tran- 
sitions in the cerium pnictides. Also quoted are the specific volumes on the two sides of 
the transition [32] . The notation (d) and (1) refers to calculations with delocalized or 
localized Ce /—electrons, i.e. tetravalent or trivalent Ce atoms. B2* denotes a slightly 
distorted B2 structure (see Ref. [39] for discussion). 



compound 


transition 


Pt 

theo. 


(kbar) Vj, 

expt. theo. 


(ao^) 

expt. 


V, (ao-^) 
theo expt. 


CeN 


Bl(d) ^ B2(d) 


620 


148 


- 


141 


- 


CeP 


Bl(l) ^ Bl(d) 


71 


90“,55*’ 325 


308“ 


297 


2 98“ 


CeP 


Bl(d) ^ B2(d) 


113 


150(40)“ 288 


285“ 


246 


247“ 


CeAs 


Bl(l) ^ B2(d) 


114 


140(20)“ 332 


315“ 


265 


274“ 


CeSb 


Bl(l) ^ B2*(l) 


70 


85(25)“* 400 


398“* 


353 


354“* 


CeSb 


B2*(l) ^ B2*(d) 


252 


311 


- 


295 


- 


CeBi 


Bl(l) ^ B2*(l) 


88 


90(40)“ 427 


399“ 


376 


360“ 


CeBi 


B2*(l) ^ B2*(d) 


370 


317 


- 


304 


- 



a: Ref. [33]. b: Ref. [34]. 
c: Ref. [35]. d: Ref. [36] . 
e: Ref. [37] . 



6.3 Cerium Monopnictides 

The cerium monopnictides, CeN, CeP, CeAs, CeSb and CeBi, undergo a variety 
of structural and magnetic phase transitions under pressure. They have been 
thoroughly studied experimentally, and here we discuss a comparison of those 
studies with the SIC-LSD calculations. A detailed summary of our SIC-LSD 
ealculations is given in Table 3, where we present the respective transition pres- 
sures and volumes associated with a variety of transitions in comparison with 
the experimental data. The SIC-LSD calculations involved the total energy cal- 
culations as a function of volume for CeN, CeP, CeAs, CeSb and CeBi for B1 and 
B2 structural phases, in the ferromagnetic (F) arrangement of Ce moments, and 
with the /—electron treated as either delocalized (LSD) or localized (SIC-LSD) 
[38,39,30]. 

As can be seen in Table 3, the calculations faithfully reproduce the pressure 
behaviour of the Ce pnictides. In particular, we find that only for CeN the / 
electron is delocalized at ambient conditions, while the other cerium pnictides 
are characterized by localized / electrons, in accordance with experimental ob- 
servation. For CeP two phase transitions are observed, first the delocalization 
transition and subsequently the structural transition from B1 to B2 structure. 
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The two transitions merge into a single one for CeAs, namely a transition from 
a B1 structure with the / electron localized to a B2 structure with the / elec- 
tron delocalized. For the two remaining systems, CcSb and CcBi, the structural 
transition occurs first and at higher pressures does the / electron become deloca- 
lized. This trend may be understood in terms of increasing / electron localization 
with increasing nuclear charge of the ligand. Moreover, for CeSb and CeBi the 
SIC-LSD predicts a second isostructural B2— >-B2 transition to occur at higher 
pressures, where the /—electrons are delocalizing. It seems necessary to perform 
measurements for these systems at pressures above 250 kbar to clarify whether 
such transitions occur in reality. 

The magnetic properties of cerium monopnictides, with the exception of CeN, 
are rather complicated with several antiferromagnetically ordered phases, inclu- 
ding a Devil’s staircase, and phase transitions as a function of pressure, tempera- 
ture and applied magnetic field [34,40-43]. Therefore, we have for CeP performed 
a series of calculations exploring the energetics of various conceivable magnetic 
orderings. Apart from the ferromagnetic structure, LSD (delocalized / electrons) 
and SIC-LSD (localized / electrons) calculations were done in the AFl and AF2 
structures. In the AFl structure the cerium moments are ferromagnetically or- 
dered within (100) planes, which then are antiferromagnetically stacked in the 
(100) direction. Similarly, in the AF2 structure the cerium moments are ferroma- 
gnetically ordered within (111) planes, which arc antiferromagnetically stacked 
in the (111) direction. In accordance with experiment an AFl groundstate with 
the localized / electron [39] was obtained. The equilibrium volumes of the three 
magnetic structures were found to be virtually identical. CeP was found to be se- 
mimetallic in the minimum energy position in both the AFl and AF2 structures, 
but at negative pressures (expanded volume), a semimetal-semiconductor tran- 
sition took place. This semimetallic behaviour was found to originate from small 
hole pockets around the F point in the center of the Brillouin zone and compen- 
sated by electron pockets around the M point on the Brillouin zone boundary. 
These pockets are mostly confined to the basal plane of the Brillouin zone. These 
results arc in agreement with do Haas- van Alphen [44,45] and photoemission [46] 
findings regarding the location of the electron and hole pockets. 

7 Conclusions 

We have described the SIC-LSD formalism that provides a mechanism for trea- 
ting the static Coulomb correlations within ah initio band theory. We have de- 
monstrated that it can treat both localized and delocalized electrons on equal 
footing, and owing to that is capable to study systems and properties for which 
LSD fails. What SIC-LSD does is to assign an energy contribution, the self- 
interaction correction, for an electron state to localize. Whether a system is 
localized or delocalized is then a result of a fight between this explicit ’localiza- 
tion energy’ and the band formation energy Both LSD and SIC-LSD arc local 
minima of the DFT energy functional. Like LSD, SIC-LSD is still a one-electron 
theory and both work only in extreme situations. Namely, when U /t <C 1, with 
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t being a typical hopping integral, LSD provides the correct physics, while for 
[//t 1, SIC-LSD gives the valid description of the physical situation. Most 

importantly, as discussed in the present paper, SIC-LSD provides a correct de- 
scription of pressure induced transitions from predominantly localized to pre- 
dominantly delocalized groundstates, and is in general much more appropriate 
approach for systems with strong Coulomb correlation. 
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Abstract. Ab initio formulations of the interlayer exchange coupling (lEC) between 
two, in general non-collinearly aligned magnetic slabs embedded in a non-magnetic 
spacer are reviewed whereby both the spacer and the magnetic slabs as well as their 
interfaces may be either ideal or random. These formulations are based on the spin- 
polarized surface Green function technique within the tight-binding linear muffin-tin 
orbital method, the Lloyd formulation of the lEC, and the coherent potential appro- 
ximation using the vertex-cancellation theorem. We also present an effective method 
for the study of the temperature dependence of the lEC. The periods, amplitudes, and 
phases are studied in terms of discrete Fourier transformations, the asymptotic be- 
havior of the lEC is briefly discussed within the stationary-phase method. Numerical 
results illustrating the theory are presented. 



1 Introduction 

Oscillatory interlayer exchange coupling (lEC) has been found in a number of 
ferromagnetic/non-magnetic multilayer systems and is in some cases accompa- 
nied by an oscillatory magnetoresistance. The physical origin of such oscillations 
is attributed to quantum interferences due to spin-dependent confinement of the 
electrons in the spacer. The periods of the oscillations with respect to the spacer 
thickness can be correlated to the spacer Fermi surface, a relation frequently 
used in experimental studies. A number of models have been proposed to ex- 
plain this phenomenon and we refer the reader to excellent recent reviews on the 
subject [1-3]. 

The situation is much less satisfactory if the amplitudes and/or phases are 
concerned. They both depend sensitively on the details of the Fermi surface, and, 
from the experimental point of view, on the quality of the multilayers. Typically, 
samples include various amounts of disorder at interfaces as well as in the bulk 
(c.g., surface roughness, intermixing, impurities, grain boundaries, etc.) which 
can influence the amplitudes and the phases significantly. From the theoretical 
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standpoint of view it is important to keep in mind that the lEC is an oscillatory 
phenomenon for which, strictly speaking, amplitudes and/or phases are defined 
only in the asymptotic limit. Experimental data, however, are usually only avai- 
lable for the first few oscillations which are sufficient to extract periods, but not 
amplitudes and phases, in particular for the so-called long-period oscillations. 
The presence of impurities not only complicates the theoretical studies but also 
can provide a valuable insight into the effects controlling the lEC. In particu- 
lar, substitutional alloying can provide a valuable informations concerning the 
topology of alloy Fermi surfaces. Alloying has also another, more subtle effect, 
namely it influences both amplitudes and phases and it can even introduce an 
extra damping of the oscillation amplitude (an exponential damping in addition 
to the usual 1/A^^ decay, where N is the spacer thickness) if k||-resolved electron 
states in the neighborhood of so-called callipers (extremal vectors of the Fermi 
surface) are influenced by disorder. Finally, we mention that a special case of 
alloying is intermixing of magnetic and spacer atoms at interfaces which can 
signihcantly influence coupling amplitudes and which occurs frequently during 
sample preparation in actual experiments. 

It is thus obvious that the study of the effect of alloying on the periods, ampli- 
tudes, and phases of the lEC is an important issue which, however, is not properly 
reflected in the available literature. Conventional bandstructure methods are of 
limited use for such studies although in particular cases, when combined with 
the virtual-crystal-type approximations (VGA), they may be justified, e.g., for 
VCr or CrMn alloy spacers studied recently [4]. However, the complete neglect 
of alloy disorder makes a reliable determination of the coupling amplitudes or 
phases and, to some extent, even of the coupling periods, uncertain even in such 
favorable cases. 

In addition, reliable conclusions and verifications of experimental measure- 
ments can only be based on a parameter-free theory. In order to determine the 
lEC one typically estimates the energy difference between the ferromagnetic (F) 
and antiferromagnetic (AF) alignment of a system consisting of two magnetic 
slabs separated by a non-magnetic spacer. Using total energy differences (eva- 
luated with the local density approximation to the density functional theory) 
represents an extremely difficult task as the tiny exchange energies have to be 
subtracted from the background of huge total energies. Even if one employs very 
fast and accurate linear methods and computational tricks, the spacer thickness 
for which the calculated lEC values are reliable, is limited to about 20 layers [5,6]. 
On the other hand, for thin spacers this is the most accurate approach. One can 
alternatively employ asymptotic theories which are, strictly speaking, valid in 
the opposite regime, namely, for large spacer and magnetic slab thicknesses. The 
idea is to determine reflection (transmission) coefficients for an isolated inter- 
face between magnetic and spacer metals and the extremal vectors of the spacer 
Fermi surface. The former quantities then determine the coupling amplitudes 
and phases while the latter quantities their periods. In this case the calculations 
can be performed by using conventional bandstructure methods and, in addi- 
tion, they will provide a deep insight into the physical nature of the lEC [7]. 
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Note, however, that neither of the above techniques can be extended to treat 
disorder nor can they be used to interpolate between two limits, namely, the case 
of thin spacers (preasymptotic region) and of thick spacers (asymptotic limit). 
For this a theory is needed which can bridge both the preasymptotic and the 
asymptotic region within a unified framework: lEC values for a large set of spa- 
cer thicknesses (say, for 1-100 atomic layers) can be analyzed in terms of discrete 
Fourier transformation in order to reliably determine not only periods, but also 
coupling amplitudes and phases. In addition, one can sample various subsets in 
order to analyze both the preasymptotic and the asymptotic regime as well as 
long-period oscillations. 

The basic idea is to determine the lEC directly by employing the so-called 
magnetic force theorem [8,9] for rotations in spin space rather then shifting atoms 
as in the conventional force theorem [10]. We can thus use the same potentials 
for both the F and AF (or, in general, rotated) alignments of the magnetic 
slabs (the frozen-potential approximation) and consider only the single-particle 
(Kohn-Sham) energies. 

This allows a direct formulation of interlayer exchange coupling based on an 
application of the Lloyd formula [11] in order to evaluate the difference between 
the grand canonical potentials of the F and AF alignment. The first calculations 
of that type were performed by Dederichs’s group in Jiilich [12]. The method 
used in the present paper extends the above approach in three relevant aspects: 
(i) a reformulation within the framework of a surface Green function technique 
by which linear scaling of the numerical effort with respect to the number of 
layers [13,14] is achieved; (ii) a proof of the so-called vertex-cancellation theo- 
rem [15] in order to study the influence of alloy disorder on the properties of the 
lEC, and (iii) an efficient method for a fast and accurate evaluation of integrals 
involving the Fermi-Dirac distribution function in order to study effects of finite 
temperature [16,17]. In the present paper we will review these particular tech- 
niques that were developed in the past few years and subsequently applied to a 
number of cases including alloy disorder [18-22]. In addition, we have studied 
systematically the effect of non-magnetic cap-layers [23,24] on the periods, the 
amplitudes, and the phases of the oscillations of the lEC. 



2 Formalism 

In this section we derive an expression for the lEC for in general non-collinearly 
aligned magnetic slabs embedded in a non-magnetic spacer. 

2.1 Geometry of the System 

The system considered consists of a stack of layers, namely, from the left to 
the right: (i) a semi-inhnito (nonmagnetic) substrate, (ii) a left ferromagnetic 
slab of thickness M (in monolayers, MLs), (iii) a nonmagnetic spacer of thickn- 
ess N, (iv) a right ferromagnetic slab of thickness M', and (v) a semi- infinite 
(nonmagnetic) substrate. The thickness of the ferromagnetic slabs may extend 
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to infinity. Eventually, one of the semi-infinite substrates may be substituted 
by a finite nonmagnetic cap of thickness P interfacing semi-inhnite vacuum. In 
general, the various parts of the system can consist of different metals, including 
disordered substitutional alloys. We assume that the spin orientation of the right 
magnetic slab is rotated by an angle 9 with respect to that of the left magnetic 
slab. In particular, the cases 0 = 0 and 0 = tt correspond to the ferromagne- 
tic and antiferromagnetic alignments of magnetic moments of two subsystems, 
respectively. 



2.2 Electronic Structure of the System 

The electronic structure of the multilayer is described by means of the tight- 
binding linear-muffin tin orbital (TB-LMTO) method [25]. In particular we 
employ the all-electron scalar-relativistic version as generalized to the case of 
random alloys, their surfaces and interfaces [26,27]. The key quantity of the for- 
malism, the physical Green function G{z), is expressed via the auxiliary Green 
function g°^{z) in the screened tight-binding LMTO representation a as 

G(^) = A«(^) + M“(^).9“(z)/i“(2), (1) 



where 






( 2 ) 



Here is a matrix of screened structure constants and P°^{z) is a 

site-diagonal matrix of potential functions ■ The potential functions are 

diagonal with respect to the angular momentum index L = {im,) and the spin 
index a =t,i while the structure constants are spin- independent. The potential 
functions can be expressed via the so-called potential parameters C, A, and 7 
in the following manner 



w A+{j-a){z-C)’ 

where for matters of simplicity all indices are dropped. Similarly, the quantities 
A“ and in (1) can be expressed as 



A“(z) 



7 — a 

A + (j - a)(z - C) ’ 






A + {j — a){z — C) 



( 4 ) 



As only the screened representation will be used the superscript a is omitted in 
the following. 

A separate problem is the determination of potential functions P{z) for a 
given layered structure. Here we only mention that by employing the magnetic 
force theorem wc can use the same potential functions for the ferromagnetic and 
rotated (or, antiferromagnetic) alignments. For random systems treated within 
the so-called coherent potential approximation (CPA) the potential function 
P{z) is substituted by its coherent potential counterpart, P{z), whereby the 
formal structure of the Green function (2) remains the unchanged. The methods 
of determination of (coherent) potential functions for collinear alignments of 
magnetic moments in the present context can be found elsewhere [27,26]. 
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2.3 Definition of the lEC 



The exchange coupling energy £x, evaluated in the framework of the magnetic 
force theorem, is defined as the difference of the grand canonical potential 
between the ferromagnetic (A = F) and antiferromagnetic (A = AF) alignments 
of two subsystems, i.e. = f^AF — ^f- More generally, the quantity of the 
physical interest is the difference of the grand canonical potentials between a 
rotated (9 ^ 0) and the ferromagnetic (9 = 0) alignment of the two magnetic 
slabs, namely, S-xi^) = = 12(0) — 12(0). 

The grand canonical potential 12 of a system is defined by 

/ OO 

/(E,T,/i)iV(E)dE, (5) 

-OO 



where N(E) is the integrated valence density of states, f{E,T,fj,) is the Fermi- 
Dirac distribution function at the temperature T and the chemical potential fj, 
of electrons. It should be noted that at zero temperature the chemical potential 
coincides with the Fermi energy Ep of the system. The integrated valence density 
of states is then given by 

N{E) = --lm[ Ti G{E' + lO) dE' , (6) 

^ , 1 — nn 



where Tr means the trace over lattice sites R, angular momentum indices L 
{im) and spin indices a. Using (3,4), the following identities can be verified 



^A(.) = -A»(.) , ^P(z)=PC). 



Together with formula (94), we find 

d 
dz 

The grandcanonical potential (5) is then expressed as 



Tr In \{z) + Tr In g{z) = -Tr G{z) . 



1 

I2(r, = f{E, T, g) Tr In \{E + fO) dE 

^ J — CO 

/ OO 

f{E,T,g) Trlng(F; + iO) dE . 

-OO 



1 

7T 



( 7 ) 



( 8 ) 



( 9 ) 



The formula in (9) is the expression for the grandcanonical potential within the 
TB-LMTO method [28] and for finite temperatures. 

The rotated magnetic configuration is characterized by the set of rotation 
angles O = {0r} for all the lattice sites. In the reference (F) state all the angles 
0R = 0 while in the rotated state 0 r = 0 in the rotated magnetic layer and 
0R = 0 for all other lattice sites. The quantities X{(-),z) and g{(-),z) for the 
rotated system are given by 

A(0, z) = U(0)A(O, z)U^ (0) , g{0, z) = [U(0)P(O, z)Ut (0) - S]~K (10) 




318 



Josef Kudrnovsky et al. 



Here [U((9)]rr' = <5rr'H(0r) is the rotation matrix for spin 1/2 particles defined 
in terms of the single-site matrices U (0r) [29] 




( 11 ) 



where c = cos(0/2), s = sin(6l/2), U(0)Ul(0)= Ut(0)U(0) = 1, and detU(0) = 
detljl(0) = 1. We note that in the rotated magnetic coiihguration P{0,z) = 
U(0)P(O, z)lJl(6)) is generally a non-diagonal matrix with respect to the spin 
indices <j, a' . 

The hrst term in (9) is independent of 6 because X{z) is site (and layer-) 
diagonal, it therefore does not contribute to the exchange energy £x{0), i-e., it 
is sufficient to consider the second part only, 

1 

(2{0,T,n) = Im / f{E,T,n)Trlxig(0,E + t0) dE . (12) 

^ J — CO 



It should be noted that the above expression is valid only in the absence of 
spin-orbit coupling. 

The magnetic force theorem used here for the evaluation of the lEC was used 
also in related problems, e.g., for the evaluation of the exchange energies of two 
impurities embedded in a nonmagnetic host [8] and then extended to the case 
of Heisenberg exchange parameters between two sites in a magnetic material [9]. 
In the latter case the magnetic force theorem is valid only for the infinitesimal 
rotations while in the former case it is valid also for 6 = tt [30] . 



2.4 Configurational Averaging 

Keeping in mind applications to random systems, one is interested in the conh- 
gurational average of the expression in (12), namely, 

(1?) = Im / f{E,T,jj,) {Ti\ng{E + i0)) dE , (13) 

where (...) denotes a configurational average. Difficulties here arise from the fact 
that the configurational average of the logarithm (In g{z)) can differ significantly 
from the logarithm of the configuration average In ((7(2)). The difference X = 
(In g) — In (g), the so-called vertex correction, is difficult to calculate and usually 
cannot be neglected. Fortunately, this problem can be circumvented by using 
the vertex cancellation theorem [15], which states that the contributions from 
the vertex correction for the F and AF configurations cancel each other exactly, 
namely Tr Xaf — Tr Xp = 0, such that to first order with respect to the angle 
between the magnetizations in the two ferromagnetic layers vertex corrections 
can be omitted. In other words, the evaluation of (13) simplifies to 

1 

(f?) = Im / f{E,T,g)Ti\Ti{g{E + i0)) dE , 

^ J —00 

= --Im f f{z,T,iJ,)Tichi(g{z)) dz. 

JC 



(14) 
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We have also substituted the energy integral by integration over a contour in 
the complex energy plane 0 . The possibility to neglect vertex corrections can 
conveniently be used in calculations of the interlayer exchange coupling as expli- 
cit numerical calculations have shown that it remains valid to a good accuracy 
even for an angle as large as tt [15]. In this respect it is very similar to the force 
theorem [10]. It is important to note that such an extension is only applicable 
to the evaluation of exchange energies of magnetic systems interacting via a 
non-magnetic host. An evaluation of exchange energies in ferromagnetic systems 
such as parameters of a classical Heisenberg model, was claimed to be limited 
to inhnitesimal rotations only [9]. The use of the vertex-cancellation theorem 
allows to reduce the computational time in Hrst-principles calculations by al- 
most two orders of magnitude, so that the computational effort for disordered 
systems is comparable to that for a pure system [15]. We refer the reader to 
Appendix A for more details concerning the derivation and applicability of the 
vertex-cancellation theorem. The last remark concerns the fact that the expres- 
sion for the change in the grandcanonical potential within the magnetic force 
theorem also includes the classical magnetostatic dipole-dipole interaction energy 
(DDIE). The DDIE decays with a spacer thickness much faster than the lEC and 
its contribution can be thus neglected for thicker spacer anyhow. In addition, 
first-principles fully-relativistic calculations of the lEC [32] have demonstrated 
that this term has a negligible influence even for a rather thin spacer amoun- 
ting just to a few layers. Consequently, the DDIE term will be neglected in the 
following. 

2.5 Lloyd Formula 

We need to evaluate the difference of configurationally averaged grandcanonical 
potentials in the rotated and FM conhgurations. This can be done conveniently 
with the help of the well-known Lloyd formula [11] applied to layered systems. 
We formally split the system into two non-interacting fragments, namely a left 
fragment £, which consists of the left substrate and the left magnetic slab, and 
a right fragment TZ, which comprises the rest of the system, i.e., the spacer, 
the right magnetic slab, and the right substrate (or, eventually, the cap layer 
interfacing the vacuum). Fragments are described by the unperturbed Green 
function {go{z)). In the next step we coTiple two fragments together with help of 
a localized potential V which is simply the interlayer screened structure constant. 
This procedure has a number of advantages as compared to a conventional way 
of embedding two finite magnetic layers into the inhnite (bulk) host spacer [12]: 
(i) the perturbation V is independent of the thicknesses of magnetic layers; (ii) 
complicated sample geometries can be treated, including semi-infinite magnetic 
layers; and (hi) a powerful and efficient method exists for the evaluation of the 
Green function of fragments, namely the surface Green function technique in the 
principal-layer formulation [26,27]. 
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Keeping in mind the vertex cancellation theorem, one gets for a difference in 
the configurationally averaged grandcanonical potential (14), the expression 

{6n) = --lml f{z,T, hi {1 -V {go{z))) dz, (15) 

TJ" Jc 

where {go{z)) is the configurationally averaged Green function of the decoupled 
non-interacting fragments C and TZ defined above. For the sake of simplicity, we 
will denote from here on the configurationally averaged quantities by an overbar, 
e.g., (go(^)) = go{z)- The concept of principal layers (PL) [33] as used within the 
TB-LMTO method leads to a block tridiagonal form of the structure constants 
and of the inverse Green function. If we apply this tridiagonality to (15), we get 
for V and (go{z)) the following expressions by using a superniatrix notation with 
respect to nearest-neighbor PLs resolved in the wave-vector k|j , 

Uoi(k||) 0 

where 5'io(k||) = [5'oi(k||)]^ Combining (15) and (16) one gets 

(5Trln5(2:) = ~ rc{h\\,z)Qn{^\\,z)] , (17) 

II k|| 

P/:(k||,2;) = S'io(k||)g£(k||,2;)5oi(k||). 

Here the quantity L£(k|| , z) has the meaning of an effective embedding potential, 
and the quantities Qc and Q-ji are the configurationally averaged surface Green 
functions (SGF) [33] of the magnetic subsystems £ and TZ, respectively. By 
definition, the surface Green function Qs {S = C, TZ) is the top PL projection of 
the Green function of the corresponding semi-infinite system S. Its determination 
in the case of random systems was extensively discussed in the literature, see 
[34-36,26]. The summation in (17) extends over the surface Brillouin zone (SBZ) 
corresponding to the underlying two-dimensional translational symmetry [37], 
and 1V|| is the number of sites in a layer. 



) ^ ^ g^(k|j,2)) ’ 



2.6 The lEC for a General Angle 6 

Let us now turn to the evaluation of the energy difference between arbitrary 
alignments. Consider the following quantity, 

tr In Z = tr In (1 — Ao B) — tr In (1 — Aq Bq) , (18) 



where the matrices Aq and Bq are related to the ferromagnetic alignment and 
thus are diagonal in spin space 



A(, 0 

0 Ai 



Bn 



0 

0 Bj, 



(19) 
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The particular form of the subblocks Aq and Bg (cr =t, |) is given by 

A^ = 5io(k||)02(k|i,2)5oi(k||), B^ = 0^(k||,z). (20) 

The matrix B refers to an alignment in which the orientations of the magne- 
tization in two magnetic slabs are rotated uniformly by a relative angle 9, 

B = U(0)BgUt(0), (21) 

where U(0) is the rotation matrix (11). The quantity 1 — Aq B in (18) can the- 
refore be written as 



1 — Ag B = 



U(0)-AoU(0)Bg)Ut(0), 



where, as follows from (19) and (11), 



(22) 



U(0) - Ao U(0) Bo = 

Using now the identity tr In X 
matrix X, and the identity 



c(l-AjBj) s(l-Aj 
-s (1 - A^ bJ) c (1 - A^ 

Indet X, which is valid for 



Bi) 

B;() 



(23) 



any non-singular 



det 




detA . detD . det(l - A^^ B C) , 



(24) 



which in turn is valid, if the matrices A and D arc non-singular, it is straight- 
forward to prove that 



trhi Z 



ti'L In 



1 — cos(0) 
2 




(25) 



where 



M = 1 - (1 - Aj Bj)-i (1 - Aj Bi) (1 - Ai Bi)-i (1 - A^ bJ) . (26) 



It should be noted that in (18) tr denotes the trace over angular moinenta and 
spin, while in (24) tr^ denotes the trace over orbital momenta only. The final 
expression for Sx{9) is thus given by 



Sx{9) 



TtA'iI 



^ Im / f{z,T,fi) 



ic 



tr^ln (i_L^^M(k||,^)) d^, 



(27) 



in which the energy integral is expressed in terms of a contour integral which 
will be discussed in detail later. 
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It is interesting to note that the expression (26) for M(k||,2;) can be rearran- 
ged in the following form [19] 

M = - (i - 5io Soi gi) Sio (gi - si) X 

(l - Soi gi Sw si) 5*01 (s^ - S^) . (28) 

It explicitly factorizes the ’spin-asymmetry’ of the problem and it is directly 
related to RKKY-like theories [1]. This result [19] is formally equivalent to the 
results of the spin current approach [39] as formulated within a Green function 
formalism based on an empirical single orbital tight-binding model [40]. A matrix 
version developed in the framework of a semiempirical tight-binding model has 
appeared recently [2]. 

For completeness we also give the result for the common case of the antifer- 
romagnetic alignment (0 = tt): 

£x = £x{t^) = Im / f{z,T,ii) tiL\nM(k\\,z)dz , (29) 

" k|| 

where tV( is a product of four terms, 

A4 = (1 - Aj Bj)-i (1 - Aj Bt) (1 - At B^)-^ (1 - A^ bJ) . (30) 



2.7 The Torque and Infinitesimal Rotations 

The differential change in the grand canonical potential S(2{0) with respect to 
a differential relative angle 6, —d 5fi{6) / d 6, is usually called the torque. The 
torque can easily be obtained by differentiating (27) with respect to the angle 6. 
By definition one gets therefore 

T(0) = -^^^ or 640) = - T{9')d0', (31) 

whereby T{6) follows immediately from (27) 



= / /(Ar,h) 



27t7Vi 



I k|| 

trL 



IC 



,z) ( 1 - - [1 - cos(0)] M(k|| , z) 



dz . (32) 



By formally expanding the logarithm in (27) in powers of 1 — cos(0), one can 
cast the expression for 6x((^) into the form 

£x(0) = [1 - cos{0)] + ^ 52 [1 - cos(0)]2 + . . . , 



(33) 
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where B\ and B 2 are the so-called bilinear and the (intrinsic) biquadratic exchange 
coupling coefficients, respectively, 



^ II k|| "'c 

^ II k|| 



(34) 



It may be, however, more convenient to fit the exact expression (27) into the 
form (33) by employing calculated values for 9 = Trj2 and 9 = tx [41]. We obtain 

jj ‘2 Sx{tx /2) Sx{tx) — 2Sx{tx /2) 

Bi - ^ , B 2 - ^ . (35) 

Of particular interest is the expansion of Ex{6) for a small 9, i.e., when 
1 — cos(0) is a small parameter (the method of infinitesimal rotations (MIR)). 
This approach becomes particularly relevant in the case when the spacer is a 
magnetic metal or for complicated geometries, e.g., for so-called periodic multi- 
layers. 



2.8 The lEC as Interface Interface Interaction 

We will now discuss briefly an alternative approach of a direct evaluation of the 
lEC as a difference in the interface-interface interaction energies rather then its 
indirect determination in terms of the energy of a single interface (13-16). We 
decouple the system into three fragments, a left, central, and right fragment, £, C, 
and 77., respectively. The left and the right fragment are formed by corresponding 
substrates with magnetic slabs whereby the central slab comprises the spacer. 
Both approaches are physically equivalent because it is irrelevant how the system 
is divided into an unperturbed part and a perturbation. Note, however, that 
the interface-interface formulation is more general as it could be used for a 
determination of interaction energies of two generally different interfaces. 

The derivation proceeds in two steps and employs partitioning technique with 
respect to the trace of the logarithm of the Green function. First, the subsystems 
£ and 77 are downfoldcd which leads to an effective problem of two localized 
perturbations in the subsystem C. The second step, a two-potential formula 
applied to the fragment C separates directly the interface-interface contribution. 
The result has formally the same structure as the previous one (17,26), but the 
subblocks Aq and Bg (cr =t, ^) are now of the following form 

AS = 5Afi(k||,2)rf(k||,2:)giAr(k||,2;) , BS=f^(k||,z). (36) 

The r-matrices Tj (i = 1, N) corresponding to ’’multiple scattering” at individual 
interfaces £/£, (7 = 1) and C/77, (7 = N) are expressed as 



= I/‘"(k||,z) [1 



5’«(k|i,^:)7”'"(k|j,2) 



-1 



(37) 
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where the effective embedding potentials (k|| , z) of the left and right interfaces 
{i = 1, A^), respectively, are defined as 

f f (k|| , 2 ) = 5io (k|| ) (ky , 2 ) 5oi (k|| ) , (38) 

a(k||,^) =5oi(k||)g^(k||,0) 5io(k||). 

Here, {S = £, TV) are the configurationally averaged SGFs of the left and the 
right semi-infinite regions, respectively. Details of the derivation can be found 
in Appendices B and C. The coupling between the two magnetic subsystems is 
due to the layer off-diagonal projections giAr(k||, 2 ;) and 5 ]vi(k||, 2 ;) of the Green 
function (GF) of the finite spacer consisting of N layers. The oscillatory behavior 
of interlayer coupling is then governed by the oscillatory behavior of these quasi 
one-dimensional spacer Green functions, a formTilation which is very much in 
the spirit of a simplified RKKY approach [1]. An efficient method of evaluation 
of the corner-blocks of the Green function, gij(k\^,z), {i,j = l,N), is described 
in Appendix D [42,36]. 

2.9 Relation to the KKR Method 

We shall discuss now the relation of the present approach (29,30,36) to the me- 
thod employed in [12] and based on the Korringa-Kohn-Rostoker (KKR) Green 
function technique. Let us note first the deep internal connection between the 
KKR and the TB-LMTO-GF approach (see [26,43] for more details). The model 
in (12) consists of an infinite ideal non-magnetic spacer as a reference system 
and of two magnetic slabs representing localized perturbations. For simplicity 
we start from the case of two magnetic monolayers in an infinite spacer. The 
result 



AS = S]Vi(^)(k||,^)i?(ki|,0)giV(k||,z), Bo" = t^(k||,z) (39) 

is formally the same with the exception that the r-matrices entering (36) are 
now substituted by the single-site t-matrices £ which describe the scattering of 
electrons from two magnetic monolayers at i = 1, N embedded in an infinite 
non-random bulk spacer and separated by A^ — 2 spacer layers: 

= APZ^,{z) [l + G\k^^,z)AP^.^{z)]-" . (40) 

The strength of the scattering potential, APy^[z), is given by the difference 
of the potential functions for the magnetic monolayer Py^[z) and for the non- 
magnetic spacer P{z), while g^(k||,z) is the layer diagonal block of the GF of 
the bulk spacer. The layer off-diagonal blocks of the bulk spacer GF, gj’^(^) and 
Gni{z), are given by 

gJ^(k|j,z) = [g*(k||,^)Aoi(k||)]'^“' e'(k||,z), (41) 

and similarly for G%i{z). Here, g®(k||, 2 ;) is the corresponding SGF of an ideal 
semi-infinite non-magnetic bulk spacer [33]. It should be noted that also the 
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layer-resolved bulk Green function can be expressed in terms of the 

SGFs (see, e.g., [34]). Since (41) is exact, there is no need to perform an addi- 
tional All -integration [12]. It is easy to show that the result is formally identical 
to the case of two impurities in a simple tight-binding linear chain model with 
nearest neighbor hopping. 

A generalization to the case of magnetic slabs containing a finite number 
M of magnetic layers is formally straightforward [12]. The t-matrices tyi{z) are 
then supermatriccs with respect to angular momentum and layer indiees and 
the numerical effort to evaluate (40) increases with the third power of M as 
contrasted with the results of the present approach (17,36) which depend only 
linearly on M. 

2.10 Influence of External Periodicity 

Until now it was assumed implicitly that we have a simple ’’parent” lattice [37]. 
The periods of the coupling oscillations are closely related to the Fermi surface 
geometry [1,2] of the bulk spacer. A different translational symmetry (complex 
lattices) or stacking sequence within layers will thus tend for sufficiently thick 
spacers to a different kind of bulk periodicity and hence to new periods. For 
example, an alternating stacking of fcc(001)-layers Cu and ordered c(2 x 2)- 
CuAu layers tends to an ordered fcc-CusAu alloy with a Fermi surface topology 
different from that of fcc-Cu spacer. For a discussion of ’’superlattice” formation 
in magnetic multilayers see also [38]. We will discuss in the following in some 
detail two possibilities, namely superstructures in the spacer and in the magnetic 
slabs. 

We start with the former case by assuming the same geometry as discussed 
in Sec. 2.1 but now the spacer slab consists of two non-magnetic metals A and 
B with respective thickness ha and Hb periodically alternating. Typically, the 
spacer layer starts with the layer A[B) and ends with the layer B{A), but the ter- 
mination of the spacer slab with the same layers is also possible (and interesting 
[22]). The particular case of riA = us = 1 corresponding to an (OOl)-stacking 
of an ordered fcc-CuAu alloy was already treated on a first-principles level [22]. 
The more geireral case, (n^,ns > 1), which corresponds to artificially grown 
superstructures, was treated only within a simple one-band model [44]. In both 
cases, new periodicities (in comparison with the spacers consisting from pure 
A or B metals) arise with an increasing number of repetitions. Alternatively, 
one can consider a superstructure within a given spacer layer, or combination of 
both, e.g., the above mentioned example of the ordered fcc-CusAu alloy spacer. 
The similar situation can be encountered also in the magnetic slabs. In particular 
the case of a c(2 x 2)-CoFc periodicity within the magnetic layers separated by a 
fcc-Cu(OOl) spacer [20] leads to the rather surprising appearance of new periods. 
These new periods can be now correlated to critical points of the spacer Fermi 
surface folded down to the Brillouin zone corresponding to a c(2x2)-superlattice 
[20]. A correlated gradual appearance of new periods and the order in statisti- 
cally disordered layers is a clear indication of their relation to a different bulk 
periodicity [20,22]. 
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A special case of alternating layers of A and B metals is when one of metals 
is magnetic and the other is nonmagnetic, all of which sandwiched between two 
substrates. This is the case of a periodic multilayer. 

The generalization of the present formalism to above discussed cases is rather 
straightforward. In the case of a superlattice within a layer it is just sufficient 
to substitute matrices appearing in (27,29) by the corresponding supermatrices, 
e.g., by (2 x 2)-supermatrices in the case of a c(2 x 2)-superlattice. The key 
quantity, the surface Green functions (20), can be easily evaluated also in 
this case (see for details [26]). The generalization of the formalism to the case 
of alternating layers from A and B metals is as well simple because the surface 
Green function is constructed in an epitaxial manner, i.e., layer by layer, and it 
is therefore immaterial if the stacking of layers consists of the same or a different 
material. In the limit of a periodic multilayer we should just keep in mind that a 
proper repeating unit consists now from four layers, namely S — M — S — M, where 
the symbols S and M refer to the spacer and magnetic layers, respectively. This 
is necessary to calculate the F and AF conhgurations needed for the evaluation 
of the lEG. We note that the present formalisin allows to evaluate efficiently 
and reliably the lEC for thick spacers (one hundred layers and more) which is 
important for realistic studies of so-called superlattice spacers and of periodic 
multilayers. 

2.11 Temperature-Dependence of the lEC 

We conclude this section by reviewing a recently developed technique for an 
efficient evaluation of the temperature dependence of the lEC [16]. The main 
cause for the temperature dependence of the lEC is connected with thermal 
excitations of electron-hole pairs across the Fermi level as described by the Fermi- 
Dirac function. It turns out that other mechanisms (as for example electron- 
phonon and electron-magnon interactions) are less important. We rewrite (29) 
into the following form 

54r) = Im/(T), I{T)= f f{z,T,f^)^{z)dz, (42) 

Jc 

where 

^ (43) 

II k|| 

with the energy integration performed over a contour C along the real axis and 
closed by a large semicircle in the upper half of the complex energy plane. 

The integral in (42) can be recast into a more suitable form using the analytic 
properties of 'I'iz), namely, (i) 'l'{z) is holomorphic in the upper half of the 
complex halfplane, and (ii) z]P{z) — s- 0 for 2 ^ 00 , Imz > 0. Let us define a new 
function <P{y) = —i'P{Ep + iy) of a real variable y, y >Q. Then at T = 0 K, 

r-\-oo 

7(0) = / ^{y)dy, (44) 

Jo 
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while at T > 0 K, 



/(r) = 2^fcsr^^(yfc), (45) 

fe=i 

where ks is the Boltzmann constant and the yk arc Matsubara energies, yk = 
-KkBT{2k — 1). In the limit T — > 0, I{T) — 7(0) continuously. 

We have verified that the function ^(y) can be represented accurately as a 
sum of a few complex exponentials of the form 

M 

'^(y) = exp(pjy) , (46) 

i=i 

where the Aj arc complex amplitudes and the pj arc complex wave numbers. 
An efficient method of finding the parameters Aj and pj is described elsewhere 
[16]. The evaluation of 7(T) is then straightforward: 

M ^ 

7(T) = -27TkBT g exp {nkBTpj) - exp {-nkBTp,) ’ 



which for T = 0 K gives 



^ A 

'(“) = - E ^ ■ <«) 

The efficiency of the present approach allows to perform calculations with a large 
number of k|| -points in the irreducible part of the surface Brillouin zone (ISBZ) 
in order to obtain accurate and reliable results. Note also that such calculations 
have to be done only once and then the evaluation of the lEC for any reasonable 
temperature is an easy task. 

The effect of finite temperatures on the lEC can be evaluated also analyti- 
cally. The analytical approach assumes the limit of large spacer thickness, for 
which all the oscillatory contributions to the energy integral cancel out with 
exception of those at the Fermi energy. The energy integral is then evalua- 
ted by a standard saddle-point method [1]. The general functional form of the 
temperature-dependence of the interlayer exchange coupling £x{T) in the limit 
of a single period is then given by 

cNT 

£x{T)=£x{Q)t{N,T), = (49) 

Here, N denotes the spacer thickness in monolayers, and c is a constant which 
depends on the spacer Fermi surface. The term £x{^) exhibits a standard N~“^- 
dependence [1], while the scaling factor t{N,T) depends on the product N and 
T. In the preasymptotic regime (small spacer thickness) the functional form of 
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t{N,T) differs from that of (49), particularly in the case of a complete, but rela- 
tively weak conhnement due to the rapid variation of the phase of the integrand 
which enters the expression for the lEC [45] . The present numerical technique is 
free of the above discussed limitations and can be used to check conclusions of 
model theories. 



3 Numerical Results and Discussion 

3.1 Details of Calculations 

Special care has to be devoted to the energy and the Brillouin zone integrations. 
For a finite temperature we determine the parameters of the complex exponenti- 
als in (46) through an evaluation of ^(y) at 40 Matsubara energies corresponding 
to r = 25 K. Wc have verified that the results depend weakly on the actual value 
of the parameter T. For T = 0 K we have tested two energy contours C, namely 
a semicircle between the bottom of the band (Emin) and Ep, or, alternatively, 
a line contour Ep + is, s £ (0,cx)), using a Gaussian quadrature. The results 
were very similar in both cases. Using a line contour avoids possible problems 
connected with the phase of a complex logarithm. Typically a total of 10-15 
energy points was used. A large number of k|| -points in the ISBZ is needed only 
for energy points close to the real axis, whereby generally a greater number is 
needed for lower temperatures and thicker spacers. The number of k||-points can 
signihcantly be reduced for energies well off the real axis. In particular, for the 
first energy point on the contour close to the Fermi energy we typically use 5000- 
10000 k||-points in the ISBZ, while for the next 3-4 energy points the number 
of k||-points is reduced by a factor two for each other point, and about 50-100 
k||-points are taken for all remaining energy points on the contour. The thickness 
of the spacer, for which well converged results are obtained, is about 100 spacer 
layers. 

3.2 Analysis of the Results 

The calculated results, namely £x{0, N), where N specifies the spacer thickness, 
can be analyzed in terms of a discrete Fourier transformation 

1 ^ max 

= - y] N‘^£x{ 0,N) eypp{iqN) , (50) 

^ N = Nrni„ 

where p = Nmax — Nmin -I- 1 is the number of values used in the Fourier ana- 
lysis, and Nrain is chosen in order to eliminate the effect of very thin spacers, 
or, to analyze intentionally either the preasymptotic or the asymptotic region. 
Typically p is about 40. The background oscillations thus occurring [14] are due 
to the discreteness of the Fourier transformation. The background oscillations 
can be smoothened using the procedure described in [46], namely by multip- 
lying N‘^£x(6 , N) by C s\n(pN /p) / (pN Jp), where C is a normalization factor. 
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The periods of oscillations Aa (in monolayers) are then identified with the po- 
sitions q„ of pronounced peaks of \F{qa)\ as A„ = 27r/ga, the amplitudes of 
oscillations are estimated from A„ = {2/p)\F[qa)\, and their phases from 
(t>a = 7>'/2 — AigF{qa), {a = 1,2,...). This analysis can be extended to more 
complicated cases, namely when the lEC is a function of two variables, e.g., 
as a function of the spacer and cap thicknesses N and P, respectively. A two- 
dimensional discrete Fourier transformation 

N2 P2 

F2{e,qN,qp)= ^ g (A + P)2£2(0,fV,P)e‘(«~'^+«"^) (51) 

N=Ni P=Pi 

is a suitable tool to analyze the quantity €2{0, N, P), where the prefactor (N -|- 
P)^ is consistent with the asymptotic behavior [23,47] for large spacer and cap 
thickness. Strictly speaking, this is quite an obvious choice for the case when 
the spacer and cap are formed by the same material, but it can be used also 
when the spacer and the cap correspond to different materials (for more details, 
see [23]). In (51) we have introduced the quantity 

S2{9,N,P)=S^ie,N,P)-£o{0,N), £o{e,N)= lim £,(ff,7V,P) (52) 

P->oo 

in order to remove a trivial peak in the absolute value of F2{9,qN,qp) at qN = 
qp = 0. A similar two-dimensional discrete Fourier transformation is also useful 
in the study of the lEC with respect to the thicknesses of the spacer and the 
magnetic slabs. We note that if one of variables, e.g., the spacer thickness N 
is hxed, it is possible to analyze the calculated lEC values again with the help 
of (50). 

An alternative of calculating the Fourier transform (50) consists in subdivi- 
ding the k||-integral in (27) into areas around the critical k||-vectors (callipers) 
related to the different oscillation periods [30,49]. In the asymptotic limit each 
subarea gives then rise to a single oscillation period, while in the preasymptotic 
regime the resulting division into different periods is only qualitatively valid. In 
a sense this method bridges the present method of discrete Fourier transforma- 
tions and the purely asymptotic treatment of calculating only the behavior of 
the critical k|| -vectors (see Section 3.3). 

3.3 Asymptotic Expansion 

Model studies [1,2] indicate that in the asymptotic region, i.e., for large spacer 
and magnetic layer thickness, and for a random sample, the general form of the 
spacer-thickness dependence of the lEC is given by 

£ 2 : = ^ ^ ex.p{iQaN) . (53) 

a 

Here the sum runs over all possible periods a, the quantities Za and are 
the complex amplitudes and complex stationary points (callipers), respectively. 




330 



Josef Kudrnovsky et al. 



defined in the following manner 

= Aa exp(f^a) , Qa = Qa + ■ (54) 

The quantities A„ and are the amplitudes and phases of coupling oscillations, 
Pa = 27t/(/o, their periods, and the quantity Aq characterizes the damping of 
oscillations due to the effect of alloying in the sample determined at the Fermi 
energy. In the limit of non-random samples, Aq, = 0. 

The parameters in (53) can be extracted from a detailed knowledge of the 
spacer Fermi surface [7]. We briefly sketch a numerical way of determining of 
the parameters of this asymptotic expansion which requires the knowledge of 
the integrand of (29) for a set of k|| -points in the neighborhood of the stationary 
points k||“\ 

The expression (29) for lEC at T = 0 K can be rewritten as 

^ Im V E(k||) , F(k||) = - f f{z,0) tiL lnM0i\\,z)dz . (55) 

fV|| ^ Jc 

The integration with respect to the energy variable is performed numerically. 
The function T(k||) for large N decreases as 0{1/N) and behaves like 

y(k||) = ^exp(rlV0(k||)), (56) 

where the pre-exponential factor (/(kn) is a smooth function of k|| and the phase 
<))(k||) has one, or more stationary points in the SBZ that correspond to callipers 
of the spacer Fermi surface such that Vkn ) = 0. The integral over the SBZ in 
(55) can be evaluated using the stationary-phase method. The contribution of a 
stationary point k|“^ = is found in the following way: the integration 

limits are extended to inhnity, and the phase function </>(k||) is approximated by 
a quadratic function of k|| = [kx, ky) in the vicinity of the stationary point, 

</)(k|i) = 0(k[“^) + ^ Q,j{k,~k!f^){kj -kf^) 

= Q^3k^kJ+ ( 57 ) 

i,j=x,y i—x,y 

The expansion coefficients Qij, Pi, and ^(k|“^) are determined by a least-square 

fit to values of <)'(k||) calculated in the vicinity of k|“\ This procedure allows 
to eliminate numerical inaccuracies with respect to both the values of Qij and 
the position of the stationary point k|“^ , and it is applicable even for disordered 
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surfaces. By inserting (56) and (57) into (55) we find 



1 



TriVVsBZ 






jj exp|^iAf^())(k|"^) + ^ Qij(fci - fcj“^)(fcj - fcj“^)^jdfca;dfcj; 






■g(kP) 

N“^Vsbz 1 i/- det IQI 



Im 



exp 



iAf^(k|“^) 



(58) 



where the two-dimensional integration region D extends to infinity, and Vsbz 
denotes the volume of the SBZ. The second line in (58) is obtained by diagona- 
lizing the quadratic form in the exponent (57) and by evaluating the resulting 
one-dimensional Gaussian-like integrals. The identification of the parameters is 
now straightforward, namely 



7T .9(k||"^) 

Vsbz det |Q| ’ 



Qa = <^(k[|“^). 



(59) 



3.4 Pree-Electron Limit 

The numerical efficiency of the present formalism offers an interesting possibility 
of testing model theories [1]. The simplest of such models is the free-electron 
model, because of a spherical Fermi surface with a single critical vector at k|| = 0 
and a trivial correspondence between the value of the oscillation period and 
the band-filling. The free-electron model can be easily simTilated by the present 
formalism by replacing the true metallic potentials by flat potentials (the empty- 
sphere model). For a suitable choice of the lattice constant and the position of 
the Fermi energy it is irrelevant what lattice and layer stacking is used, e.g., 
the fcc(001)-stack is the simplest choice. Such a model is free of the limitations 
usually adopted [1], e.g., the assumption of large spacer and magnetic slabs 
thicknesses, or the approximate evaluation of the energy integral for the case of 
finite temperatures. 



3.5 Numerical Illustrations 

In Fig. 1 N‘^Sx{N) is displayed as a function of the spacer thickness N for two 
semi-infinite Co(OOl) subsystems sandwiching an fcc-Cu spacer. The correspon- 
ding discrete Fourier transformation in Fig. 2 shows a pronounced short-period 
oscillations of 2.53 monolayers (MLs) while the long-period oscillations are sup- 
pressed in this geometry [13,14,49]. The results are insensitive to the choice of 
the lower and upper index in the summation in (50) provided the preasymptotic 
region is excluded [14]. 

For a large enough N the lEC can be approximated by the asymptotic form 
in (53). The amplitude, phase, and the wave- vector entering this expression can 
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N (monolayer) 



Fig. 1. Exchange coupling N^£x{N) at 'i' = 0 K as a function of the spacer thickness 
N for two semi-infinite fee Co (001) subsystems sandwiching a Cu spacer. Diamonds 
refer to the calculated values, the full line (back Fourier transform) serves as a guide 
to the eye 



be determined from the calculated £x{N) in the manner as described in Sect. 3.2 
and the asymptotic result (53) was compared with the calculated results for a 
large set of systems including both ideal and alloyed semi-infinite fcc(OOl) ma- 
gnetic subsystems sandwiching a Cu-spacer: overall good agreement was found 
[21]. An example of the complex amplitude for this case is presented in Fig. 3 
illustrating the insensitivity of the phase to elements which form the magnetic 
layers. It is seen that phases corresponding to Co, FesoNiso, and Fei/aNii/aCoi/a 
which have the same average electron numbers Ngi=9 are nearly the same [21]. 

The lEC depends on the temperature T via a factor x/sinh(x), x = cNT, 
where T is the temperature and N the spacer thickness. This remarkable result 
of model theories [1] was verified by calculations such as illustrated in Fig. 4. 
The lEC depends in an oscillatory manner not only on the spacer thickness N 
but as well on the thickness P of a covering cap. The oscillations are around a 
biased value which corresponds to coupling for a given spacer thickness assuming 
a semi-infinite cap. This phenomenon is illustrated in Fig. 5 in terms of discrete 
Fourier transformations with respect to the spacer and the 
cap thickness (see Sec. 3.2) for a sample consisting of a semi-infinite fcc-Cu(OOl) 
substrate, left and right magnetic layers each five MLs thick, a spacer with 
varying thickness N ^ and a Cu-cap of varying thickness P. Fig. 5 shows: (i) 
long-period oscillations (missing in Fig. 2) in addition to the short-period ones, 
and (ii) oscillations with respect to the cap thickness which arc exactly the same 
as for the spacer because both are controlled by the same Fermi surface, namely 
that of fcc-Cu. The more complicated case of different spacer and cap materials 
is discussed in [23,24]. 
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Real part of Z 



Fig. 2. Absolute value of the discrete 
Fourier transformation of N^Sx{N) for 
a finite set of spacer layers (A^=20-80) 
corresponding to two semi-infinite fee 
Co(OOl) subsystems sandwiching a Cu 
spacer. The temperature is T = 0 K 



Fig. 3. Complex amplitude = 

y^i/ 2 gt'S/ 2 ^ where A and <P are the os- 
cillation amplitude and phase, respec- 
tively, for a semi-infinite fcc(OOl) subsy- 
stems formed by Fe, Co, Ni, their binary 
alloys (bullets), and the ternary alloy 
Fei/sCoi/sNii/s (square) sandwiching a 
Cu spacer. The units are (mRy)^/^. The 
dotted, dashed, and full lines connect va- 
rious alloys and serve as a guide to the 
eye. The rays starting at the origin show 
approximately the phase corresponding 
to the indicated average number of va- 
lence electrons 



Ordering in the spacer [22] or in the niagnetie layers [20] can induce new 
periods due to the formation of two-dimensional sublattices. The situation is 
particularly interesting for a c(2 x 2)-ordering in magnetic layers sandwiching 
an ideal Cu-spacer [20]. As illustrated in Fig. 6 for full ordering two new periods 
with complementary periods and phases are formed in addition to a conventio- 
nal short-period due to a fcc-Cu spacer [20]. These new periods vanish in the 
completely disordered case. 






8,(N, T)/ 8,(N, 0) 
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Fig. 4. £x{N,T)/£x{N,T = 0) plotted 
as a function of ^ = NT for a trilayer 
consisting of semi-infinite fee Co(OOl)- 
slabs sandwiching a Cu spacer. The 
thick line refers to x/sinh(x), x = cNT 
with c = 0.000195 obtained by a least 
square-fit to the computed data 




Fig. 5. Absolute values of the discrete two-dimensional Fourier transformation of 
{N + P)'^ £ 2 {N, P) with respect to the spacer and the cap thickness in the case of 
two magnetic slabs each five monolayers thick with a Cu-substrate, a Cu-spacer, and 
a Cu-cap. For a definition of £2 (A”, P) see the text 



Finally, the effect of disorder in the spacer [19] is illustrated in Fig. 7. Alloying 
of Cu with Ni decreases the number of average valence electrons and leads to a 
contraction of the alloy Fermi surface, and in turn to a reduction of the coupling 
oscillations. The opposite behavior has to be expected for alloying of Cu with 
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Fig. 6. Absolute values of the discrete 
Fourier transformation of Ex(N) for 
two semi-infinite fee Co 5 oFe 5 o( 001 ) sub- 
systems sandwiching a Cu spacer with 
different kinds of chemical order in ma- 
gnetic layers: (a) S' = 1 (complete c(2 x 
2)-order, full line), (b) S = 0.8 (dashed 
line), (c) S — 0.5 (dashed-dotted line), 
and (d) S = 0.0 (disordered case, dotted 
line). The temperature is T = 0 K 



Zii, whereas only a Siiafi concentration dependence of the periods for the CuAu 
case is seen. The amplitudes of the oscillations are generally reduced by alloying, 
and in the case of CuZn spacer they are even 

exponentially damped. The different behavior of the amplitudes can be related 
to differently large disorder in the neighborhood of relevant extremal points of 
the alloy Fermi surfaces. 



3.6 Some Published Applications 

We briefly review applications of the formalism developed in previous sections 
to specific problems. Additional details concerning formalism and not discussed 
here in details, e.g., the expansion of the lEC expression in terms of the small 
parameter 1 — cos(0) or the details concerning the numerical verification of the 
vertex-cancellation theorem, can be found in [14,15], respectively. The influence 
of surface roughness (fluctuating spacer thickness and diffusion at the interface 
between spacer and magnetic layers) on the oscillation amplitudes was studied 
in [18]. The effect of alloying in the spacer [19] on the oscillation periods and their 
amplitudes, and in magnetic layers [21] on the oscillation amplitudes and phases 
was also studied in detail for the trilayer system Co/Cu/Co(001). Ordering in 
disordered spacers [22] and/or magnetic [20] layers lead to a formation of new 
periods not present in ideal spacers. Oscillations of the lEC can originate not 
only due to the spacer but also from adlayers or cap layers. We refer the reader 
interested in this problem to a recent detailed study [23,24]. Finally, the study 
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Fig. 7. Exchange coupling N^£x{N) at T = 0 K as a function of the spacer thickness 
N for two semi-infinite fee Co(OOl) subsystems sandwiching a spacer of (from bottom 
to top) Cu, Cu 75 Ni 25 (multiplied by a factor 5), CusoZnso, and CusoAuso- Diamonds 
refer to the calculated values, the full line (back Fourier transform) serves as a guide 
to the eye 



of the temperature dependence of the lEC and of the combined effect of the 
temperature and disorder is subject of very recent papers [16,17], respectively. 

4 Conclusions 

We have derived closed expressions for the exchange coupling between two ma- 
gnetic subsystems separated by a non-magnetic spacer with a relative angle 9 
between the orientations of the magnetizations in the magnetic slabs. The de- 
rivation is based on a surface Green function formalism. The numerical effort 
scales linearly with the thickness of both the spacer and the magnetic slabs. The 
formulation allows also for an efficient evaluation of the temperature depen- 
dence of the coupling amplitudes. Numerical examples were chosen to illustrate 
the theoretical aspects rather than to give a comprehensive overview of results 
obtained by the present formalism or by related methods. 
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We wish now briefly to mention some unsolved problems. The following list 
is neither complete nor are the problems listed according to their importance: 
(i) The oscillatory dependence of the lEC on the thickness of the magnetic slabs 
was not yet systematically investigated on an ab initio level. Existing calcula- 
tions [14,48,49] were performed for too thin magnetic slabs to relate occurring 
oscillations to extremal points of spin-polarized Fermi surfaces; (ii) The problem 
of biquadratic and higher order terms also did not receive a proper attention on 
an ab initio level. A relevant problem is a systematic study of situations for which 
the non-collinear (biquadratic) coupling can dominate. Obviously, it can happen 
most probably for the spacer thicknesses for which the lEC values are close to 
the transition between the F and AF couplings [41]. In addition, it remains to be 
seen whether a theoretical description of biquadratic coupling has to be based 
on a fully relativistic spin-polarized level; (iii) The study of superstructures in 
the spacer and/or in the magnetic slabs (see Sec. 2.10) offers a possibility of 
a deeper insight into the physical nature of the lEC because of new periods, 
which are connected with the extremal vectors of the spacer material in a more 
sophisticated manner than in the canonical cases of Cu or Cr spacers; (iv) The 
study of oscillatory behavior of exchange interaction across magnetic spacers is 
of great iirterest. One possibility here is to employ the method of infinitesimal 
rotations [9,14]; (v) The study of exchange coupling through the semiconducting 
or, more generally, through an insulating spacer where one expects exponential 
rather than A^“^-decay has remained limited until now to model studies [1]; (vi) 
The study of alloying in the spacer, magnetic layers and at interfaces has to be 
extended to new interesting systems. It offers a straightforward method to ob- 
tain valuable informations concerning alloy Fermi surfaces, in particular for the 
case of alloyed spacers; and, finally (vii) The study of the lEC through spacers 
with complex Fermi surfaces, in particular through the transition metal spacers. 
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5 Vertex Cancellation Theorem 

We present here a general discussion of exchange interactions in the presence of 
substitutional disorder. The results given here are used in the present paper to 
study interlayer exchange interactions, but they are also applicable for studying 
exchange interactions within a ferromagnet, exchange stiffnesses, spin-wave en- 
ergies, etc. The principal result is the “vertex cancellation theorem” of Bruno et 
al. [15]. In here we give an alternative, more general, derivation of this result. 
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Let u = {ufi} be a particular configuration of the local moments, where 
Ur is a unit vector pointing in the direction of the Jf2-th local moment. We are 
interested in the variation of the thermodynamic grandcanonical potential 

J7fl = --Im/ /(£:,T)Tr(ln5u(L; + iO+))dL; (60) 

^ 7—00 

with respect to u. The Green function ga{z) for a particular alloy configuration 
is dehned from the potential function Pu{z) corresponding to u as 

9u{z) = {Pu{z) - 5')“^ . (61) 

An immediate consequence of (61) is a trivial commutator relation to be used 
below, namely 



[-Pu(2);Su(2)]_ = [5’;5ru(z)]_ , (62) 

where [A; B]_ = AB — BA. The conhguration averaged Green function ((?u(^)) = 
g^{z) is usually formulated in terms of the coherent potential function Va{z) as 

5u(^) = i'Ptiiz) - 5”)^^ , (63) 

which leads to a relation analogous to (62), 

['Pu{z);gi^{z)]_ = [S;gi^{z)]_ . (64) 

In general, the averaging in (60) cannot be reduced to In 5 ^( 2 ;) and an evaluation 
of the so-called vertex corrections is necessary. We shall show, however, that the 
variation of (60) due to an inhnitesimal change of u takes a simple form. 

Let us consider the variation of the potential functions Pu(z) in more detail. 
To each lattice site R we associate a non-random vector 0 r = ^rUr, where 
Ur refers to the axis of rotation and 9r to rotation angle by which the reference 
orientation Uq.r is transformed into Ur. The transformed potential functions 
are therefore given by the following similarity transformation 

Pc,{z) = UePuo{^)U^\ (65) 

where the rotation matrix Uq in (65) is defined as 



rLs,R' U s' ^ ^RM' ^L,L’ X 





Ur ■ a 

- s,s' 



(66) 



The symbol cr in (66) denotes the vector of the standard 2x2 Pauli matrices 
and 1 is the 2x2 unit matrix. The first-order change of Pti.R{z) caused by an 
additional infinitesimal rotation 5 vr is then expressed as 



SPi,{z) = [SK;Pi,{z)]_ , 



(67) 
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where the matrix elements of the operator 5K = Usv — 1 are explicitly given by 

i^^)RLs,R'L's' = ^R.R' ^l,L' [a- ■ . ( 68 ) 

The introduced infinitesimal rotation vectors Jvjj satisfy U^v Uq = Uq+sb whe- 
reas, in general, Use Uq ^ Ue+se- Let us note that 5K is a non-random site- 
diagonal operator. 

The first-order variation of Tr (In 5 ( 1 ( 2 )) can be now formulated using (94, 
67) as 



^Tr(ln 5 u( 2 )) = -Tr( 5 u( 2 ) [5/C;F(i(2)]_) , (69) 

which can be rewritten by applying the permutation invariance of the trace and 
(62, 64) as 



5Tr(lii5u(2)) = - Tr{6K ([Pu(2);5u(2)]_)} 

= -Tr{(i/C([^;5u(^)]_)} 

= -Tv{5K [5;5a(2)]_} 

= -Tr{diC[Fu(2);5u(^)]_} ■ (70) 

By using the permutation invariance of the trace once again, (70) can be given 
the final form 



5TT{higt,{z)) = -TY{g^{z)[5K]Vis{z)]_] . (71) 



Let us note that (71) was derived in a formally exact alloy theory, but is valid in 
the CPA as well. Within the CPA, the result (71) has an obvious interpretation: 
the r.h.s. describes the variation of Tr In 5 (^( 2 ) induced by performing on the site- 
diagonal coherent potential functions Vxx.r{z) the same rotations ( 68 ) as applied 
to the potential functions Pu,r{z)\ note however, that this is not equal to the 
infinitesimal change of the true self-consistent CPA coherent potential function. 

Thus, the torque acting on the moment at site R due to the exchange inter- 
actions is given by 



Lla.R = — 



(5vh 



1 

7T 



r-\-oo 



f{E,T) X 



ImTr| 5 fl(£: + z0+)^^ [Rrct ; Vu{E + i0+)]_\ dE , (72) 



where Ur is a projector on site R. This exact result constitutes the “vertex 
cancellation theorem” for the torque. Its usefulness arises from the fact that the 
“vertex corrections” have been eliminated. 

In order to compute the difference of thermodynamic grandcanonical poten- 
tial between two local moment configurations Ui and U 2 in the CPA, we use a 
theorem due to Ducastelle [31], which states that the thermodynamic grandcano- 
nical potential, considered as a functional f2[P,P] of the independent variables 
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V and P, is stationary with respect to V when the latter satisfies the CPA 
self-consistency condition. This means that a first-order error in gives only a 
second-order error in Cfl. Let us approximate Vvl{z) by 

Vi,{z) « V'^{z) = Ue Vuoiz) , (73) 

i.e., we assume that Vvl{z) is transformed like Pu{z) under a rotation of the local 
moment direction. This can be expected to be a good approximation, provided 
the condition 

rriR < kp qn (74) 

is satisfied, where qu and vir are respectively the charge and spin moment at 
site R. We then get 

9ix{z) ~ gi{z) = {V'^{z) - Sy^ . (75) 

Replacing by and by in (71), we obtain 

5Ti- (ln 5 rn( 2 ;)) w 5Ti' Ingl ( 2 :) , (76) 

and integrating over the angles, we get 

1 /■+“ 

« - - / f{E,T)x 

J —00 

Im Tr [\ngy{E + i0+) - lngy{E + i0+)] dE , (77) 

which constitutes the “vertex cancellation theorem” for exchange energies. Note 
that we have derived here a form of the “vertex cancellation theorem” within the 
CPA since this is the scheme which is used in practical calculations; however, 
one can prove that the same result holds if one takes the exact solution to the 
configuration averaging problem. 

In the case of interlayer coupling, the condition (74) is satisfied even for large 
rotation angles, because d0R/dR differs from zero only in a region where uir 
is negligible. This was confirmed by explicit numerical calculations in [15]. 

6 The Interface— Interface Part of the Grandcanonical 
Potential 

In this Appendix we derive the basic relations for an evaluation of the lEC within 
the interface-interface interaction formulation. 

The subsystems C and TZ can be downfolded using the formula (88) 

Trln(P- S') = Trrln \P - s\ ATri^ln [P- S 

+ Trc In [(P - S)cc - {P - S)ccj^{P - S)cc 

- {P - S)cnj^{P - S)nc] ■ 



(78) 
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The first two terms are independent of the rotation angle 9 and, consequently, 
they do not contribute to the exchange energy £x{9). We are thus left with 
a quantity which is limited to the subspace C only. It is now easy to identify 
the individual terms in (78). The potential function blocks between different 
subspaces such as Pec or Pen are zero because the potential function P is site- 
diagonal. The blocks of S between neighboring subspaces do not vanish, but the 
non-zero subblocks connect only neighboring principal layers. The important 
part of the Tr In {P — S) is then reduced to 

Trc In {P - S)cc + Trc In [l - j^SioGcSoi - j^SoiGnSio] ■ (79) 

The first term is independent of 9 and thus does not contribute to the exchange 
energy. The second term can be simplified using the two-potential formula (93). 
We identify = C/{P — S), Vi = SioGc^oi, and V 2 = SoiGnSio- The t- 
matrices are then identical with the r-matrices, and the potentials Vi and V 2 are 
equal to the embedding potentials Pi and Tj. In this way we find the expression 
for the grandcanonical potential 



G(0,T,m) = Go(T,^) 



Illl 

7T 



f{E,T,i^i)Tii In 1 - giN{z)TN{z)gNi{z)Ti{z) 



dz , 



(80) 



where f2o(T,n) contains all the terms independent of 6 and the Tri applies only 
to the layer 1, i.e., the first spacer layer. If the system is invariant with respect 
to translations in the planes of atoms, or, if such a symmetry is restored by 
configuration averaging, (80) can be written as 



G(0,T,m) 



Go(T,p) 




fiE,T,fx) X 



trln [l 



5iAf (k|| , 2:)TAr (k|| , z)gAfi (k|| , 2 ;)ti (k|| , ^) 



dz , 



(81) 



where tr means the trace over angular momentum indices L = {£m) and the spin 
index a. 



7 Useful Mathematical Tools 

Theoretical developments and many calculations are facilitated by the partitio- 
ning technique and the two-potential formula applied to the Green function and 
its logarithm. 

Let P and Q denote projection operators onto the complementary subspaces 
(i.e. P + Q = 1). We denote the projections of matrices as PAP = App, PAQ = 
ApQ, etc., and P/A means the inversion of App in the subspace referring to 
projector P. In most applications, A = z — HoiA = P{z) — S and G{z) = A^^. 
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The projections of the inverse A ^ to the matrix A are given by [50] 



{A ^)pp 
{A-^)qq 

{A~^)pq 

{A^^)qp 



p 



App - Apq^Aqp 

Q 

AqQ - ^QP^^PQ ’ 

-^^PQ{^~^)qQ = ~{A~^)ppApQ^, 
~^Aqp{A^^)pp = -[A^^)qqAqp— . 



(82) 

(83) 

(84) 

(85) 



It is sometimes easier to invert the full matrix A than its blocks. In such a 
case the inverse partitioning is useful 



P 

App 



{A ^)pp 



P 

P - Apq{A~'^)qp 



P - {A '^)pqAqp 



{A ^)pp. 



(86) 



This can be used to calculate the surface Green function of a semi-inhnite system 
from the Green function of the inhnitc system. 

Partitioning technique also allows to simplify calculations involving Tr In of 
a matrix. The basic relation is 



TiTn = In det . (87) 

It then follows TrlnylB = Tr In + Tr In 5, Tr In 1 =0, Tr In {A^^) = — Trlnyl, 
and Trln [{A — B)^^\ = — TrlnTl — Trln [1 — A^^B]. The Tr In A can then be 
partitioned as 

Ti- In A = Ti-p In [PAP] + Ti-q In [QAQ - QA^AQ]. (88) 

To prove (88), let us multiply the matrix ^ by T = 1 — Aqp{P/A) from left 
and by i? = 1 — [P / A) Apq from right. The result is LAR = App + Aqq — 
Aqp{P / A)ApQ. Now using (87), and the fact that det [L] = det [i?j = 1 we find 
(88). In a special, but important case, when App = P and Aqq = Q it holds 

Tr In = Ttp+q In [P + Q + Apq + Aqp] 

= Trp In [P - Apq Aqp] = Trg In [Q - Aqp Apq] . (89) 

The Green function of a system described by the Hamiltonian H = Hq + vi + 
V 2 , where Hq is the unperturbed part, and Vi{i = 1, 2) are perturbing potentials, 
is given by G = G^°^ +G^°^TG^°\ where G = {z — H)~^ , G^°^ = {z — Ha)~^ , and 
T = H(l-G(0)y)-i, where H = v\-\-V 2 - The full T-matrix T can be expressed in 
terms of the t-matrices, ti = Vi{\ — , {i = 1, 2) and of the unperturbed 

resolvent G*-°^ by the the two-potential formula 

T = till- G^°k.2G^°k.i]-^ (1 + G^°^t2) + [1 - G(°)fiG(°)t2]‘^ X 

(l-tG(°)fi). (90) 
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It is derived in the following way. Because 

(1 - ^)[1 - (1 - A)-^AB{1 - -B) = l-A-B, (91) 



it holds 



Tr In [1- A- B]= Tr In [1 - A] + Tr In [1 - B] 
+Tr\n[l-{1-A)-^AB{1-B)-^]. (92) 

By inserting A = and B = G^^'^V 2 into (92) one obtains (90). The two- 

potential formula for the Tr In of the full Green function 

TrlnG = TrlnG(°)[l - 

= TrlnG(°) - Trln[l - G(°)ui - G(°)u2] 

= Tr In G^°) - Tr In [1 - G(°)ui] - Tr In [1 - G(°)u 2 ] 

-TiTn[l - G(°)<iG(°)t 2 ] (93) 



follows directly from (92). 

If the matrix ^4 is a function of a variable z (complex in the general case), 
the derivative with respect to x is given by 



— Trln [A{z)] = Tr 
dz 



r d 



-A(.)W(.)J 



(94) 



provided that the matrix A{z) is nonsingrdar. This identity is used to derive the 
expression of the grandcanonical potential 1? in terms of the auxiliary Green 
function (12) within the TB-LMTO. 

The identity in (87) is valid up to an integer multiple of 27ri. Neglecting 
this fact can lead to serious errors. There is no panacea for this kind of diffi- 
culties, but in some situations they can be avoided, for example by choosing 
the integration contour parallel to the imaginary axis, but this is not always 
possible. In some cases the incremental procedure for calculating the In det, 
hi/( 2 :fc+i) = 111 f{zk) + In [f {zk+i) / f {zk)] in the spirit of an analytical continua- 
tion can be helpful, provided that the change of phase between two consecutive 
points Zk is less than 27 t. To insure this, one has to choose a sufficiently small 
grid in 2 . 



8 Inversion of Block- Tridiagonal Matrices 

We wish to compute g = A^^ for a block-tridiagonal A. The matrix A is divided 
into N X N square subblocks of the same dimension m, from which non-zero are 
only Ak^k, Ak-i^k, and Ak,k-i- The diagonal blocks are a sum of two terms: her- 
mitean matrix and a symmetric complex matrix. They are always non-singular. 
The off-diagonal blocks under the diagonal are equal to hermitean conjugate of 
the corresponding blocks above the diagonal {Ak,k-i = k)- methods 

based on repeated use of partitioning are particularly efficient if only diagonal 
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blocks, or four so-called ’corner’ blocks {gi,i,gN,N ,gi,N igN,!) are needed like in 
the interlayer exchange coupling calculations. 

First, four sequences of auxiliary matrices are calculated 



XN~k 


= Aff^k,N- 


-fc-|-l(7lAr-fc-|-l,Ar-fc-|-l - 


-Xn- 


-fc-K, 


) ^An- 


-k-\-\,N — k 




Xn = 


0, (fc = l,... 


N -1) 


? 








Yu+i 


= Ak+i,k{Ak,k - Yk) ^Ak 


,k+l , 


Ti = 


0, 


(fc = 2, 


...N) 


Zk 




- Xk) ^Ak,k-ij 


[k = 


2,.. 


.N) 






Wk 




- Yk) 


{k = 


1,... 


N - 


1), 





that are used to compute the diagonal and off-diagonal blocks of g 

9k, k — {^k,k ^k Ffc) ; 

9ig — gi—lyj ^ ^ J i 

9i,j =^i9t+i,] for i<j. (96) 

It can be proved that the numerical effort to evaluate the corner blocks scales 

as 0{Nm^). The details, particularly the tests of efficiency can be found in [42]. 

References 

1. P. Bruno, Phys. Rev. B 52, 411 (1995). 

2. J. Mathon, M. Villeret, A. Umerski, R.B. Muniz, J. d’Albuquerque e Castro, and 
D.M. Edwards, Phys. Rev. B 56, 11797 (1997). 

3. See theoretical and experimental review articles in IBM J. Res. Development 42, 
No. 1 (1998). 

4. M. van Schilfgaarde, F. Herman, S.S.P. Parkin, and J. Kudrnovsky, Phys. Rev. 
Lett. 74 , 4063 (1995). 

5. M. van Schilfgaarde and F. Herman, Phys. Rev. Lett. 71 , 1923 (1993). 

6. S. Mirbt, H.L. Skriver, M. Alden, and B. Johansson, Solid State Commun. 88, 331 
(1993). 

7. M.D. Stiles, Phys. Rev. B 48, 7238 (1993). 

8. A. Oswald, R. Zeller, P.J. Braspenning, and P.H. Dederichs, J. Phys. F: Met. Phys. 
15 , 193 (1985). 

9. A.I. Liechtenstein, M.I. Katsnelson, V.P. Antropov, and V.A. Gubanov, 
J. Magn. Magn. Mater. 67 , 65 (1987). 

10. A.R. Mackintosh and O.K. Andersen, in Electrons at the Fermi Surface, Ch. 5.3., 
ed. M. Springford (Cambridge University Press, Cambridge, England, 1980). 

11. P. Lloyd and P.V. Smith, Adv. Phys. 21 , 69 (1972). 

12. P. Lang, L. Nordstrom, R. Zeller, and P.H. Dederichs, Phys. Rev. Lett. 71 , 1927 
(1993). 

13. J. Kudrnovsky, V. Drchal, I. Turek, and P. Weinberger, Phys. Rev. B 50 , 16105 
(1994). 

14. V. Drchal, J. Kudrnovsky, I. Turek, and P. Weinberger, Phys. Rev. B 53 , 15036 
(1996). 

15. P. Bruno, J. Kudrnovsky, V. Drchal, and I. Turek, Phys. Rev. Lett. 76, 4254 
(1996). 




Interlayer Coupling 345 



16. V. Drchal, J. Kudrnovsky, P. Bruno, and P. Weinberger, Phil. Mag. B 78, 571 
(1998). 

17. V. Drchal, J. Kudrnovsky, P. Bruno, P.H. Dederichs, and P. Weinberger, (to be 
submitted). 

18. J. Kudrnovsky, V. Drchal, I. Turek, M. Sob, and P. Weinberger, Phys. Rev. B 53, 
5125 (1996). 

19. J. Kudrnovsky, V. Drchal, P. Bruno, I. Turek, and P. Weinberger, Phys. Rev. B 
54, R3738 (1996). 

20. J. Kudrnovsky, V. Drchal, C. Blass, I. Turek, and P. Weinberger, Phys. Rev. Lett. 
76, 3834 (1996). 

21. J. Kudrnovsky, V. Drchal, R. Coehoorn, M. Sob, and P. Weinberger, Phys. Rev. 
Lett. 78, 358 (1997). 

22. P. Bruno, J. Kudrnovsky, V. Drchal, and I. Turek, J. Magn. Magn. Mater. 165, 
128 (1997). 

23. J. Kudrnovsky, V. Drchal, P. Bruno, I. Turek, and P. Weinberger, Phys. Rev. B 
56, 8919 (1997). 

24. J. Kudrnovsky, V. Drchal, P. Bruno, R. Coehoorn, J.J. de Vries, K. Wildberger, 
P.H. Dederichs, and P. Weinberger, MRS Symposium Proceedings, eds. J. Tolbin 
et ah, Vol. 475, 575 (1997). 

25. O.K. Andersen and O. Jepsen, Phys. Rev. Lett. 53, 2571 (1984). 

26. I. Turek, V. Drchal, J. Kudrnovsky, M. Sob, and P. Weinberger, Electronic 
Structure of Disordered Alloys, Surfaces and Interfaces (Kluwer, Boston-London- 
Dordrecht, 1997) 

27. See I. Turek, J. Kudrnovsky, and V. Drchal, these Proceedings. 

28. V. Drchal, J. Kudrnovsky, L. Udvardi, P. Weinberger, and A. Pasturel, Phys. Rev. 
B 45, 14328 (1992). 

29. See, for example, A. Messiah, Quantum Mechanics, vol. II, (North-Holland, Am- 
sterdam, 1969), Appendix C. 

30. P. Lang, L. Nordstrom, K. Wildberger, R. Zeller, and P.H. Dederichs, Phys. Rev. 
B 53, 9092 (1996). 

31. F. Ducastelle, J. Phys. C: Solid State Phys. 8, 3297 (1975). 

32. L. Szunyogh, B. Ujfalussy, P. Weinberger, and C. Sommers, Phys. Rev. B 54, 6430 
(1996). 

33. B. Wenzien, J. Kudrnovsky, V. Drchal, and M. Sob, J. Phys.: Condens. Matter 1, 
9893 (1989). 

34. J. Kudrnovsky, I. Turek, V. Drchal, P. Weinberger, N.E. Christensen, and 
S.K. Bose, Phys. Rev. B 46, 4222 (1992). 

35. J. Kudrnovsky, I. Turek, V. Drchal, P. Weinberger, S.K. Bose, and A. Pasturel, 
Phys. Rev. B 47, 16525 (1993). 

36. V. Drchal, J. Kudrnovsky, and I. Turek, Comp. Phys. Commun. 97, 111 (1996). 

37. P. Weinberger, Phil. Mag. B 75, 509 (1997). 

38. J. Zabloudil, C. Uiberacker, U. Pustogowa, B. Blaas, L. Szunyogh, C. Sommers, 
and P. Weinberger, Phys. Rev. B 57, 7804 (1998). 

39. J.C. Slonczewski, Phys. Rev. B 39, 6995 (1989). 

40. D.M. Edwards, A.M. Robinson, and J. Mathon, J. Mag. Mag. Mat. 140-144, 517 
(1995). 

41. C. Blaas, P. Weinberger, L. Szunyogh, J. Kudrnovsky, V. Drchal, P.M. Levy, and 
C. Sommers (submitted to J. Phys. I France). 

42. E.M. Godfrin, J. Phys.: Condens. Matter 3, 7843 (1991). 

43. P. Weinberger, I. Turek, and L. Szunyogh, Int. J. Quant. Chem. 63, 165 (1997). 




346 



Josef Kudrnovsky et al. 



44. M.S. Ferreira, J. Phys. Condens. Matter 9, 6665 (1997). 

45. J. d’Albuquerque e Castro, J. Mathon, M. Villeret, and A. Umerski, Phys. Rev. B 
53, R13306 (1996). 

46. C. Lanczos, Applied Analysis, (Dover, New York, 1988), p. 219. 

47. P. Bruno, J. Magn. Magn. Mater. 164, 27 (1996). 

48. S. Krompiewski, F. Siiss, and U. Krey, Europhys. Lett. 26, 303 (1994). 

49. L. Nordstrom, P. Lang, R. Zeller, and P.H. Dederichs, Phys. Rev. B 50, 13058 
(1994). 

50. P.O. Lowdin, J. Chem. Phys. 19, 1396 (1951). 




Disordered Alloys and Their Surfaces: 
The Coherent Potential Approximation 



I. Turek^, J. Kudrnovsky^, and V. Drchal^ 

^ Institute of Physics of Materials, Academy of Sciences of the Czech Republic, 
Zizkova 22, CZ-616 62 Brno, Czech Republic 
^ Institute of Physics, Academy of Sciences of the Czech Republic, 

Na Slovance 2, CZ-182 21 Praha 8, Czech Republic 



Abstract. A recently developed ab initio approach to the electronic structure of sub- 
stitutionally disordered alloys and their surfaces is reviewed. It is based on (i) the 
tight-binding linear muffin-tin orbital (TB-LMTO) method in the atomic sphere appro- 
ximation which provides a physically transparent solution of the one-electron problem 
in metallic materials, (ii) the coherent potential approximation (CPA) for a mean- 
field treatment of the substitutional randomness, and (iii) the surface Green functions 
for a proper description of the true semi-infinite geometry of surfaces and interfa- 
ces. Theoretical formulation of fundamental electronic quantities, both site-diagonal 
(charge densities, densities of states) and site non-diagonal (the Bloch spectral fun- 
ctions) is presented. Transformation properties of the LMTO-CPA theory as well as 
specific problems of application of the local density approximation to random alloys 
are briefly discussed and basic algorithms employed in the numerical implementation 
of the formalism are described. 



1 Introduction 

Recent ab initio investigations of electronic properties of solids rely on the local 
spin-density approximation (LSDA) to the density-functional formalism and on a 
number of techniques solving the corresponding one-electron Schrodinger (Kohn- 
Sham) eigenvalue problem. These techniques comprise, e.g., the Korringa-Kohn- 
Rostoker (KKR) method [1,2], the linear muffin-tin orbital (LMTO) method 

[3.4] , the linear augmented plane-wave (LAPW) method [3,5], or the optimized 
linear combination of atomic orbitals (LCAO) method [6]. They provide a rea- 
sonable description of the electronic structure for most of metallic solids even 
within the mufhn-tin model [1,2] or the atomic sphere approximation (ASA) 

[3.4] . Full-potential versions of these techniques yield in principle an exact solu- 
tion to the Schrodinger equation which is indispensable for accurate evaluation 
of total energies, forces, and other important quantities for perfect bulk solids 
(elemental metals, ordered alloys) as well as their defects (impurities, surfaces, 
grain boundaries). 

Substitutionally disordered alloys (substitutional solid solutions) represent 
a broad class of systems where the above mentioned methods are only parti- 
ally successful: their direct application requires large supercells simulating the 
randomness of real alloys. The coherent potential approximation (CPA) — in- 
troduced three decades ago [7] in terms of the Green functions - offered an 
H. Dreysse (Ed.): Workshop 1998, ENP535, pp. 349-378, 1999. 
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effective-medium (mean-field) approach to the electronic structure of random 
alloys. Further development of the CPA was formulated first in a tight-binding 
picture (TB-CPA) [8,9] followed by the KKR-CPA theory [10,11] (for a review 
of both approaches see, e.g., [2]). In the early 1980’s, the KKR-CPA became a 
theory of random alloys fully comparable to the existing charge selfconsistent 
techniques for non-random systems. The development of the TB-CPA continued 
towards an ab initio level which was motivated by a need for a physically simple 
description of the electronic structure of bulk alloys and their surfaces. This ef- 
fort led to the LCAO-CPA [12,13] and the LMTO-CPA [14,15] methods which 
can be considered as alternatives to the KKR-CPA. 

In this contribution a brief theoretical background of the LMTO-CPA me- 
thod is given together with numerical techniques used in practice. The theory 
is developed within the TB-LMTO-ASA method [16,17] which results in an ef- 
ficient unified scheme for the electronic structure of random and ordered bulk 
alloys, their surfaces and interfaces. Its full detailed description was presented in 
[15] while the numerical algorithms were reviewed in [18]. The paper is organized 
as follows. Section 2 summarizes the most important relations of the TB-LMTO- 
ASA method in terms of the Green functions. Section 3 is the central part of the 
paper: it introduces the concept of configurational averaging and describes the 
theoretical and numerical aspects of the LMTO-CPA method. Section 4 presents 
a short review of quantities and techniques for a treatment of layered systems 
(surfaces and interfaces). Section 5 deals with the application of the LSD A to 
random alloys. Finally, a brief survey of existing results and further extensions 
of the method is given in Sect. 6. 

It should be noted that the LMTO-CPA formalism bears strong similarities 
with the KKR-CPA theory within the ASA, so that expressions for many quan- 
tities (e.g., densities of states, electronic charge densities) are fully analogous in 
both approaches. However, there are differences as well which arise from different 
Hilbert spaces and Hamiltonians: the KKR theory is based on the Hamiltonian 
H — —A + V (r) acting in the Hilbert space of functions V'(t’), where r is a 
three-dimensional continuous variable, whereas the LMTO theory uses a local 
basis set with a finite number of orbitals per lattice site. The Hamiltonian is 
then a matrix quantity. Despite the fact that the spectra of both Hamiltonians 
are in principle identical (in a limited energy interval), some quantities (e.g., the 
Bloch spectral functions) become non-equivalent in the two approaches. From 
the point of view of the alloy theory, both formulations have their own merits: 
the Hilbert space of the KKR-CPA is explicitly non-random (independent of a 
particular alloy configuration), the TB (matrix) formulation of the LMTO-CPA 
offers, e.g., a simple perturbative treatment of relativistic effects (spin-orbit cou- 
pling) or an inclusion of many-body effects in terms of the iiitraatomic Coulomb 
and exchange integrals, etc. 
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2 Green Functions in the Atomic Sphere Approximation 

The solution of the one-electron Schrodinger equation with a potential V (r) can 
be equivalently formulated in terms of the one-electron Green function G(r, r' \ z) 
defined (with spin variables omitted) by [1,2] 

[z + Ar — G(r)] G{r,r']z) = 6{r — r') , 

[z + Ar> - V{r')] G{r,r';z) = S{r - r') , (1) 

where z denotes a complex energy. The Green function G{r, r'; z) is an analytic 
function of z with the exception of poles and/or branch cuts on the real energy 
axis. Within the ASA, the Green function for a closely packed solid can be 
written in the form [15,19,20] 

G{r R,r' R'-z) = - SuRf ^ ^RL{r^.z) ‘fRLir^jz) 

L 

+ ^ ^RL{r,z) Grl,r'L'{z) ^R>L'{r\z) . ( 2 ) 

LL‘ 

Here R, R' denote the lattice points (centers of the atomic spheres), L, V are the 
angular momentum indices (L = (£,m)), the variables r,r' refer to positions of 
points inside the individual atomic spheres, and the symbol (r^) denotes that 
of the vectors r,r' with the smaller (larger) modulus. The functions ipRL{f,z) 
and are defined by 

'^RL{r,z) = (pRx{r,z)YL{r) , ipRL{r, z) = ipm{r, z)YL{r) , (3) 

where r = jrj, f = r/r, and YL{r) denotes the real spherical harmonics. The 
radial amplitudes ipR£{r,z) and (fiRi{r,z) are respectively regular and irregular 
solutions of the radial Schrodinger equation for the J?-th atomic sphere of radius 
Sr and for the complex energy z. The regular solution is normalized to unity, 

Sr 

‘PRe(r,z) dr = 1 , (4) 

while the irregular solution is unambiguously specified by a smooth matching at 
the sphere boundary (r — sr) to the energy derivative of the regular solution 
‘PRiir, z) (an overdot means energy derivative). 

The Green function matrix G rl,r' l'{z) in (2) will be referred to as the 
physical Green function. It is given in terms of the potential functions Pr^{z) 
and the canonical structure constants by 

Grl,r'L'{z) = ^Re(z) dRL^R'L' + t^m{z) 9 rl,r'L'{^) > ( 5 ) 

where the quantities on the r.h.s. are defined as 

9rlm'L'{^) 



— \J > 

= {[P\z)-S^G] 



( 6 ) 
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In the last equation, the symbol P^(z) stands for a diagonal matrix of potential 
functions, Prl,r'L'(^) ~ Srl,r'L'- The matrix 9%_l,r'L'{^) will be refer- 

red to as the auxiliary (or KKR-ASA) Green function. The superscript 0 of all 
quantities in (5, 6) denotes the canonical LMTO representation. The physical 
and auxiliary Green functions are connected by a trivial relation (5), the former 
one is directly related to the Green function in real space (2), the latter one is 
of a simpler form and thus bettor suited for numerical applications. 

Let us now summarize the most important relations involved in the TB- 
LMTO theory [16,17]. The superscript a marks the corresponding representation 
specified by the screening constants aRi (the trivial choice uri — 0 corresponds 
to the canonical representation). The transformations of the screened potential 
functions Pr(^{z) and the screened structure constants from a particular 

representation a to some other representation (3 (specified by a different set of 
the screening constants Pri) are given by 

= PU^) [ 1 + - pRi) P^e(z) ]-' , 

= {"'° II + (“-««" ■ P) 

These relations serve simultaneously as definitions of the screened quantities 
from the canonical ones (P®(z), The second equation is written in a matrix 
notation with a, f3 being diagonal matrices of the form aR^ Rr^r = ocri^rl^r' u ■ 
In analogy to (6), we define 

^‘Reiz) = \j Pm^z) , A^(z) = - - | , 

^ Pmiz) 

( 8 ) 

As a consequence, one can prove two important relations, namely 

Grl,r'L'{z) = XRg{z) Srl,r'l' + f^miz) 9 rl,r'L'{z) , (9) 



and 



9rl,r'L'{z) 



(o ) Pm(z) j 

[PRe - OlRi) — fi , , 0RL,R'L' 

P'miz) 



Rt\ 



Prp\z) 



9rl,r'L'{z) 



Pru'^z) 



PR'P'iz) 



( 10 ) 



The first relation (9) implies that the physical Green function is invariant with 
respect to the choice of the screening constants aRj, cf. (5). The second relation 
(10) means that the auxiliary Green functions in different representations are 
related to each other by a simple rescaling. Let us note that the first term on the 
r.h.s. of (9) does not contribute to calculated physical quantities in most cases. 
However, its presence is inevitable for correct analytic properties of the Green 
functions (2, 9) in the complex energy plane. 
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The only non-trivial step in a calculation of the Green function for a solid is 
the matrix inversion (8) defining the auxiliary Green function (z). In the 

case of a bulk solid with three-dimensional translational symmetry, the lattice 
points R can be expressed in the form R = B + T where B runs over a finite 
number of the basis vectors while T runs over the translation lattice vectors. 
The lattice Fourier transformation of the structure constant matrix leads to a 
fe-dependent matrix quantity 

^BL,B'L'i^) = ^BL,{B'+T)L' Cxp(i fc • T) , (11) 

T 

where k denotes a veetor from the first Brillouin zone (BZ). The lattice Fourier 
transform of the auxiliary Green function, {k, z), is given by the inverse 

of a finite-dimensional matrix: 

ffBL^B>L'(k,z) = , ( 12 ) 

where P°^{z) denotes a diagonal matrix of the potential functions of the inequiva- 
lent atoms, ^BL,B’L’i^) — ^Bti^) ^BL,B'L'- The inverse Fourier transformation 
(a BZ-iiitegration) yields then all elements of the auxiliary Green function as 

9BL,{B'+T)L'i^) ~ 9 bl,b'L'{^^ z) exp(— ife-T) , (13) 

k 

where N is the number of cells in a large, but finite crystal with periodic bo- 
undary conditions. 

Further, let us mention the link between the Green functions and the stan- 
dard LMTO theory. By using parametrized forms of the potential funetions 
P^g{z) and the related quantities A^(z) and lP^[z) which are correct up to the 
second order in a limited energy region, we get 

+ {iRi ^ <^Rt) {z ~ Cm) ] ^ {z ~ Cm) , 

9miz) — [ z^Ri + {lR£ - OLRi) (z ~ Cm) ] ^ \f^m , 

A^(^) = [ ^Rt + {lR£ - oim) {z — Cm) ] ^ [iRt ~ <^Re) , (14) 

where Cm^ ^Ri and 'ym are the LMTO-ASA potential parameters [4,17]. The 

insertion of (14) with am = 0 into (5, 6) yields (in a matrix notation): 

G{z) = {z - H)-^ , H = C + y/AS° VA , (15) 

which means that the physical Green function is the resolvent of the second-order 
LMTO-ASA Hamiltonian H . It should be noted that the energy linearization 
of the LMTO method, which leads to (14) and to the Hamiltonian (15), is not 
of central importance for the Green funetion techniques discussed here, as they 
require matrix inversions rather than matrix diagonalizations. 

Finally, let us sketch briefly the evaluation of basic physical observables. As 
a rule, they are direetly related to a limit of the one-eleetron Green function 
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G{r,r']z) with respect to the upper complex halfplane, z — E + where E 
denotes a real energy variable. At zero temperature, the electronic charge density 
QH,{r) inside the J?-th atomic sphere (with spin index ignored) can be written 
as 



QR\r) 



1 

7T 



/ ap 

Im G[r + R,r -\- R] E + \0) dE 

■OO 

/ Ep 

VRL{r,E) nR,LL'{E) ifRL'{r,E) dE , (16) 

-OO 

where Ep is the Fermi energy. The quantity rift^LL>{E) is the local density of 
states matrix which is given in terms of the site-diagonal block of the physical 
Green function 

nR,LL'{E) = Im Ghl.hl' (-£■ + iO) (17) 

7T 

and which is closely related to J?L-projected and local densities of states 

nRL{E) = nR^LL{E) , nR{E) = ^ nRp{E) . (18) 

L 

The total integrated density of states, N{E), can be obtained from (8, 9, 17, 
18). In a matrix notation, the result can be written as [21] 

j-E 

N{E) = (e) ds 

T^T J — OO 

Tr log 5 “(i? + i0) + log M^(i? + iO)j , (19) 



RL 



— — Im 

7T 



RL 



where the symbol Tr means the trace over the composed J?L-index. Note that it 
is the auxiliary (KKR-ASA) Green function which appears in the expression for 
the integrated density of states N{E), in contrast to the physical Green functions 
entering the densities of states ur{E) and the charge densities QR{r). Despite 
this fact, N{E) (19) is representation-invariant as can be easily shown. 



3 The Coherent Potential Approximation 

Let us now consider the simplest model of a substitutionally disordered al- 
loy. We assume several components (atomic species) labeled by a superscript 
Q {Q = A,B, . . .) which occupy randomly the sites J2 of a given rigid lattice 
with probabilities satisfying the conditions 

E 4 = 1 . 

Q 



( 20 ) 
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We neglect completely any correlations of occupations of different sites and as- 
sume that the one-electron potential inside the J?-th atomic sphere (and conse- 
quently the related quantities like the potential functions P^{z)) depends solely 
on the occupation of this site. In order to express this model in a formal way, we 
introduce the random occupation index which takes on two values: = 1 

if an atom of the species Q is at the site R, and 77 ^ = 0 otherwise. Each con- 
figuration of the disordered alloy is thus uniquely specified by these occupation 
indices which obey the following trivial relations: 

= 1 > 5 ^^' , ( 21 ) 

Q 

which reflect the fact that a given site R cannot be empty or occupied by two 
different species simultaneously. Let us denote the configurational average of an 
arbitrary quantity as (...), then we get 

{vi) = = c%5rr> + c%c%, (1 - W) , (22) 

where the second equation expresses the absence of correlations of the site occu- 
pations. The random potential functions Pr£{z) can be then written in a form 

Pmi^) = ( 23 ) 

Q 

where P^^ (z) denotes the non-random potential function of the atom Q occupy- 
ing the site R. Equation (23) represents an important assumption of the model; 
analogous relations are valid between the random quantities A^^(z), I^R^iz) ( 8 ) 
and their Q-dependent non-random counterparts A^^(z), Let us fur- 

ther assume that the screening constants aR£ are non-random (configuration- 
independent). This implies that the structure constant matrix 5"^^ is non- 
random. The basic problem is an (approximate) configurational averaging of the 
various one-electron quantities introduced in Sect. 2. In the following, we use a 
simplified notation with omitted angular momentum indices L, L' so that ma- 
trix quantities Xrl^r>li will be abbreviated as Xr r/ (e.g., X — S°‘ , g°^{z)), 
while local (site-diagonal) quantities Wr^ll' will be abbreviated by Wr (e.g., 
W = 

We start with the auxiliary Green function g’^ (z) . Its configurational aver- 

age R’{z) can be formally written in a form (cf. ( 8 )) 

= rR.R'{^) = ( 24 ) 

which is nothing but an implicit definition of a non-random matrix quantity 
Pr r' {P) ^ the so-called coherent potential function. The complete knowledge of 
the latter is equivalent to an exact configurational averaging in (24). Approxi- 
mate alloy theories like the virtual crystal approximation, the average t-matrix 
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approximation, and the single-site CPA are based on the neglect of all site non- 
diagonal blocks of 

= (25) 

This assumption leads to a natural interpretation of the coherent potential fun- 
ctions: (z) describes the scattering properties of an effective atom at the 

lattice site R. The average Green function (24) corresponds then to a non- 
random solid formed by the effective atoms placed at the rigid lattice sites. 

There are several ways of introducing the single-site CPA [2,7,8], Here we 
present the approach of [14,15]. The unknown coherent potential functions V^{z) 
are determined in the following manner. Besides the solid with the effective 
atoms at all lattice sites, we consider a case with a particular site R occupied 
by a specified component Q while all other sites are occupied by the effective 
atoms. The auxiliary Green function in the former case is g°‘{z) (24), whereas 
that in the latter case will be denoted by Since the two systems differ 

only by a perturbation P^’ ^(^) - ^S(^) which is localized on a single site, the 
two Green functions are related by 

5RgH"^(^) = 9r',r"{z) - 9r',r{z) t’^^{z) gR^Rn{z) ■ (26) 

The quantity t'^'^Li{z) is the single-site t-matrix describing the scattering due 
to a Q-impurity in an effective medium formed by the effective atoms. It is 
explicitly given by 

^^Q(^) ^ f^^{z)[p^’^{z)-VUz) 

= 'P^^{z)-VUz)]f^^{z) , (27) 

where 

/“’^(z) = {l+ [P^^{z)-VUz)]rR,R{^)Y' ^ 

/“’«(z) = {l + g^,«(z) [p“’«(z) - VUz)] • (28) 

The CPA condition for the coherent potential functions can be now formulated 
as 

E = 9r'M^)^ (29) 

Q 

which expresses the equivalence of the average Green function g“(z) and a 
concentration-weighted sum of the Green functions g’^’^^^\z), see Fig. 1. 

As it is obvious from (26), the relation (29) is equivalent to 

E4ts^(z) = 0 , 

Q 



( 30 ) 
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Fig. 1. The selfconsistency condition of the CPA 



which is a condition for vanishing average scattering from the Q-impurities {Q — 
A, B, . . .) embedded in the effective medium. 

Equation (30) represents the standard form of the CPA selfconsistency con- 
dition [2,7,8] which specifies implicitly the coherent potential functions 'P’^(z). 
It should be noted that {z) are in general non-diagonal matrices in the 

L, L' indices, in contrast to the potential functions of the individual components 
~ ^LL’)- The CPA condition (30) has to be solved for all si- 

tes simultaneously as the single-site t-matrices (27, 28) involve the site-diagonal 
blocks of the full matrix inversion defining the average Green function (24). In 
practice, this can be done only if the whole lattice can be represented by a finite 
number of inequivalent sites. In the case of a bulk alloy with a crystal lattice and 
with a possible long-range order, the lattice sites can be written a,s R — B + T 
(see the text before (11)), where B labels the inequivalent sites, and the alloy is 
specified by the concentrations and the component-dependent potential fun- 
ctions {z) . As a consequence, the coherent potential functions for all lattice 
sites reduce to a finite set of matrix quantities Pb LL'i^)- analogy to (12), 
the lattice Fourier transform of the average auxiliary Green function is given by 

rBL.B'L'{k,z) = {[V^{z)-S»{k)]-^} (31) 

y 13 L/ .13' L/' 

where the matrix S°^{k) is given by (11) and V^{z) denotes a matrix of the cohe- 
rent potential functions of the inequivalent sites, Pbl b'L'{^) ~ Pb ^bb' ■ 

A subsequent BZ-integration yields the elements of the average auxiliary Green 
function (cf. (13)) 

9bl,{B'+t)L'{^) ~ 9BL,B'L'{k, z) exp(— ifc-T) . (32) 

k 

It should be noted that only the site-diagonal blocks {B — B' , T = 0) of g'^{z) 
enter the CPA selfconsistency condition (30). The appearance of fc-dependent 
quantities in the description of random substitutional alloys reflects a well-known 
fact that the configurational averaging restores the translational symmetry (ab- 
sent for individual configurations of the alloy). 

Despite the fact that the CPA condition (30) represents a set of coupled non- 
linear equations for the complex matrix quantities Pr general theorems 
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guarantee the existence of its unique solution which possesses the so-called Her- 
glotz property. The latter means that (i) the coherent potential functions are 
analytic functions of 2 ; outside the real energy axis, and (ii) the imaginary part 
of the matrix V^{z) is positive (negative) definite for Imz > 0 (Im 2 ; <0). 

3.1 Site-Diagonal Quantities 

The calculation of average local quantities like the charge densities (16) or the 
local densities of states (18) requires a knowledge of additional quantities besides 
the site-diagonal blocks of the average Green function (24). One introduces so- 
called conditionally averaged local auxiliary Green functions g‘^'%iz) defined 
by 

Sr%{z) = (cr) {vr9r,r{z)'^ ■ (33) 

This quantity corresponds to the site-diagonal ( J2, J2)-th block of the Green 
function averaged under the condition that the site R is occupied by the atomic 
species Q. Within the CPA, g^%{z) is equal to the ( J?, J?)-th block of the 
Green function g°‘’^^^\z) (26) corresponding to an J?Q-impurity in the effective 
medium: 

9r%[z) = gR,R{z) - gR^R[z) g%^R{z) . (34) 

An equivalent form of this result can be obtained with the help of (28): 

9 r%{z) = 9 r^r{z) f'^'^ {z) = fR'^{z)gR^R{z). (35) 

It follows immediately from (34, 35) that the CPA selfconsistency condition (30) 
can be expressed in two other forms, namely, 

^c%g‘^%{z) = g%^R{z) , (36) 

Q 

and 

= E 4 = 1- (37) 

Q Q 

The first of them can be easily interpreted: the concentration-weighted average 
of the Q-dependent conditionally averaged local Green functions is equal to the 
site-diagonal block of the average Green function. 

Let us now discuss the averaging of local observables. We define the condi- 
tionally averaged local physical Green functions (5^ j^{z) as 

= (c^) (^VRGR,R{z)'j . (38) 




Disordered Alloys 359 



Taking into account (9) and the simple configuration dependence of P^g{z), 
A^(z), and lP^{z) (23), we obtain finally 

^RL,RL'{^) — ^Rli^)^LL' + 9 rl,RL'(^) ^^R?i^) ■ (^ 9 ) 

The expressions (16, 17, 18) can be modified to get the Q-resolved average 
quantities: the local density of states matrix 

^H,LL'(-^) ~ ~ ~ I™ ^RL,RL'(-^ + iO) ’ (4^) 

the densities of states 

^Rl{E) > ( 41 ) 

L 

and the charge densities 

/ Ep 

‘/^rl(^>-^) ^h,ll'(-®) ‘T’rl'(^)-^) ■ (42) 

-oo 

One can also define average local quantities as concentration-weighted sums of 
the corresponding Q-resolved quantities, e.g., 

riRL{E) = nR{E) = c%^n%^{E) , (43) 

Q Q 



which dehne average densities of states. 

The CPA expression for the configuration average of the total integrated 
density of states N{E) is not a simple generalization of (19). The final result is 
given by [2,14,22] 



N{E) 




TT 



Tr log r{E + iO) + 51 4tr log f‘^^{E + iO) 
RQ 



RQL 



(44) 



where the symbol tr means the trace over the angular momentum index L. Let 
us mention an important variational property of N[E) (44), which is a direct 
consequence of the CPA selfconsistency [22]: N{E) is stationary with respect 
to variations of the coherent potential functions 5V^{z). This property makes 
the CPA an excellent starting point for studies of alloy energetics within the 
generalized perturbation method [23,24]. 

In numerieal implementations of the CPA as well as of other Green function 
techniques, complex energies are indispensable to obtain the limiting values at 
the real energy axis, cf. (40, 44). A useful approach to get the necessary limits 
F[E + i0) of a complex function F{z) analytic in the upper halfplane is based on 
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a relatively easy evaluation of the function F{z) for Im z > 0 and a subsequent 
analytic continuation to the real axis [25]. This procedure is justified by the 
Riemann-Cauchy relations and it employs truncated Taylor expansions of the 
function F{z). Suppose that F{z) has to be evaluated for real energies on a 
dense equidistant mesh of energy points En — Eq + nh, where h is an energy 
step and n = 0, 1, . . . , A^. Let us consider a discrete set of complex energy points 



Zn,m — Eq + uh + im/i, (45) 

where m — 0,1,..., M, and let us abbreviate Fn^m — F{zn,m)- The first step 
of a continuation procedure is the calculation of Fn^M for —M < n < N + M , 
i.e., for complex energies along a line parallel to the real axis. In each of the M 
following steps, the values of Fn,m with m reduced by one are obtained from all 
previously calculated values. The simplest examples are given by relations: 

Fn^m—l ^ 2 Fn^m T ^ T ^ - ^71+1,771 ; 

which is based on a quadratic Taylor expansion, and 



F 



n,m—l 



Fn,m “ 1 “ 2 — - Fn+l,r. 



(47) 



which is based on a repeated linear Taylor expansion. There are many modifica- 
tions of this procedure which employ higher-order expansions [15,25]. However, 
only the linear continuation (47) yields always strictly non- negative densities 
of states and the Bloch spectral functions. Typically, an energy increment of 
A ~ 5 mRy and M ~ 2 to 5 lead to a sufficiently large Im z for an initial calcu- 
lation of F{z). The continuation to the real axis according to (46, 47) represents 
then a negligible computational effort. 

An example of average densities of states is presented in Fig. 2 for a spin- 
polarized random bcc Fco.tVq.s alloy. Due to the antiparallel magnetic moments 
of the Fe and V atoms, the spin-up electrons feel a much stronger disorder than 
the spin-down electrons. The different degree of disorder is nicely reflected in the 
shapes of the local densities of states: the spin-down densities for both compo- 
nents (Fig. 2b) resemble those for the pure elements in the bcc structure whereas 
the spin-up densities (Fig. 2a) are strongly modified due to alloying. Especially 
in the latter case, the CPA describes the electronic structure substantially bet- 
ter than other single-site theories (the virtual crystal approximation, the average 
t-matrix approximation) . 



3.2 Site Non-Diagonal Quantities 

Let us now turn to the physical Green function Gb,^r'{z) and to its configura- 
tional average 



{Gr,R'{z)) — Gr^r/{z) , 



( 48 ) 
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Energy (Ry) Energy (Ry) 

Fig. 2. Spin-polarized local densities of states for Fe {full lines) and V {dotted lines) 
atoms in the random bee Feo.rVo.a alloy; (a) spin-up electrons, (b) spin-down electrons. 
The vertical lines denote the position of the Fermi energy 



and let us treat separately its site-diagonal {R = R') and site non-diagonal 
{R / R) blocks. The former are given directly by 

Gr,r{z) = ^'r,r{^) 1 ( 49 ) 

Q 

where the conditionally averaged blocks G% r{z) can be expressed according to 
(39). The site non-diagonal blocks can be rewritten with the help of (9, 23) as 

Gr^rGz) = {vr9r,r'{z)Vr') 9r? (z) ■ (50) 

QQ' 

The configurational average on the r.h.s. of (50) represents (apart from a nor- 
malization) a more complicated case of a conditional average: it refers to the site 
non-diagonal {R, J?')-th block of the auxiliary Green function averaged under 
the condition that the two sites R, R! are occupied by the atomic species Q,Q' , 
respectively. The single-site CPA expression for this kind of conditional average 
is [10,11] 

Vr9r,r'{z) = c% f‘^^{z) 9r,r'{z) c% {z) . (51) 

Equation (51) can be derived for binary alloys by means of a simple algebraic 
technique [14] while for the multicomponent case one can use Green functions 
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in an extended space [2,9]: 




A single-site CPA averaging of (52) yields then the result (51) [2]. The final 
expression for the average physical Green function follows from (49, 50, 51) and 
can be compactly written as 

Gr,r'{z) = Gr^r{z)Srr' + M^{z) g%ji,{z) M%{z) {1 - 6rr>) 

— ^RR' + ^r(^) 9r,r'{z) j (53) 

where 

fR^ (z) {z) , 

Q 

Mr{z) = ( 54 ) 

Q 

and 

^1r{z) — Gr^r{z) ~ M'^{z) gR^R{z) M%^{z) . (55) 

It should be noted that the final relation between Gr,r>{z) and gRRt{z) (53) 
bears the same formal structure as (9) for the non-averaged Green functions. 

The average physical Green function (53) can be now used to calculate the 
Bloch spectral functions. Let us consider again the case of a random bulk alloy 
with a crystal lattice and with a possible long-range order. The lattice sites can 
be written as R — B + T, where B labels the inequivalent sites and T runs over 
the translation vectors of the configurationally averaged system (see the text 
near (31, 32)). The Bloch spectral functions are defined in terms of the lattice 
Fourier transform of Gr^r'{z): 

ABL{k,E) = Im GBL,BL{k,E + \Q) , 

7T 

AB{k,E) = E ABL{k,E) . (56) 

L 

As follows from (53), the lattice Fourier transform of Gr_r'{z) can be reduced 
to that of R’{z) which in turn is given by (31): 

GBL,BL{k,z) = E ■^B,LL'{z) 9 bL' ML"{ k, z) M% ]^„]^{z) 

L'L" 

+ ^b,ll(^) • (57) 

Using (53) and elementary properties of lattice Fourier transformations, one 
can prove a relation between the Bloch spectral function ABh{k, E) and the 
corresponding average density of states tibl{E) (43), namely, 

nBL{E) = ^ E ABL{k,E) . 



(58) 
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According to this sum rule, the Bloch spectral function reflects the contributions 
of different parts of the BZ to the resulting density of states of the configura- 
tionally averaged system. Let us note that in the case of non-random crystalline 
solids, the spectral functions for a given fixed fe-vector reduce to sums of S- 
functions located at the corresponding energy eigenvalues. Hence, the concept 
of the Bloch spectral functions substitutes energy bands in random alloys and 
can be used, c.g., for a definition of the Fermi surfaces. The latter are based on 
the fc-dependence of the spectral functions (56) evaluated at a constant energy 
{E = Ef). 



3.3 Transformation Properties of the LMTO-CPA 

The physical properties of a non-random system described by the TB-LMTO- 
ASA method do not depend on the choice of a particular LMTO representation 
a as expressed by (5, 9). In the context of random alloys, it is of fundamental 
importance to know whether this feature survives the approximate configuration 
averaging within the single-site CPA. The answer is positive [15] as will be shown 
below. 

We assume that the representations a, (3 are specified by non-random scre- 
ening constants respectively. For simplicity, we will omit the energy 

arguments as well as the angular momentum indices L, L' . The transformation 
of the coherent potential functions is analogous to (7), namely, 

[1 + • (59) 

The transformations of the coherent potential functions (59) and of the non- 
random structure constants (7) lead to the following transformation of the aver- 
age auxiliary Green functions: 

9rm' = ^ “«) '^R ^RR' 

+ [v^r) n rR.R- {K') ' ( 60 ) 



which is of the same structure as (10). The transformation of the perturbation 
related to a single Q-impurity embedded in the effective medium is given by 



p/3iQ 

Rr 




p0^Q 

Rr 








R ’ 



(61) 



as can be easily derived from (7, 59). The transformation of the quantities (28) 
can be obtained with the help of (61) and of the site-diagonal blocks of (60). 
The result is 



f0,Q 

JR 

W,Q 

JR 



K (n) 



-1 



ra,Q 

JR 



P, 



R 






R 



o0,Q 

R 



-1 



pa,Q ^payl ^ 



(62) 
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which can be combined with (61) to get the transformation of the single-site 
t-matrices (27): 

(P^)-' vi . (63) 



An immediate consequence of (63) is the simultaneous validity of the CPA 
selfconsistency condition (30) in two different LMTO representations. This me- 
ans that all CPA effective media are mutually equivalent irrespective of the 
particular LMTO representation used for the formulation and solution of the 
selfconsistency condition. 

The transformations of other quantities can be derived from (59-63). This 
yields, e.g., for the conditionally averaged local auxiliary Green functions (35) a 
relation completely analogous to (10): 




{Pm — cum) 



P\ 



a,Qi 



m 



z) 






p\ 



a,Q , 



m 

1,0, Q 



f) 

P^m^{^) 

9rl,rl'{P) 



OLL' 



P\ 



a,Q 

m.' ' 



Z) 



P^Jiz) 



(64) 



One can further show that CPA averages of the physical Green functions (38, 48) 
remain invariant with respect to different LMTO representations a, which in turn 
implies the invariance of all physical observables and proves a full compatibility 
of the single-site CPA with the TB-LMTO method. 



3.4 Solution of the CPA Selfconsistency 

It should be noted that although the above three forms of the CPA condition (30, 
36, 37) arc mathematically equivalent, not all of them arc suitable for numerical 
applications. For this purpose we will introduce the so-called coherent interactor 
^RLL'{P) [2,9,14] which is a local quantity defined implicitly in terms of the 
coherent potential function and the site-diagonal block of the average auxiliary 
Green function as 



9r,r{^) — [Pr{z] ^ ^r{z)] 



(65) 



or, explicitly, as 

^r(^) — Pr{P) ^ [9r,r{z)] ■ ( 66 ) 



The coherent interactor describes the effective coupling of a given site R to all 
other sites in the system. Using this definition, one can express the conditionally 
averaged local auxiliary Green function as 



-a.Q ( \ 
9r,r{P) 




(67) 



In the following, we describe a simple iterative scheme solving the CPA condition 
(36) using the coherent interactor and (65, 67). We assume that the energy z lies 
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outside the real energy axis. For brevity, the energy argument x and the orbital 
indices L, L' will be omitted. 

The algorithm starts from an input value which can be set either to 

zero or, e.g., to the converged coherent interactor for a neighboring energy argu- 
ment. For a particular iteration leading from to the new value 

the procedure consists of three steps. First, the coherent potential function 
at each site R is set up in terms of and the potential functions and 

concentrations of all components Q according to the relation 

= E > (68) 

Q 

or, explicitly, 

= |e 4 I ' + 

Q ^ 

Second, these coherent potential functions are used to calculate the site-diagonal 
blocks of the average auxiliary Green function 

Sr%^ = ( 70 ) 

Third, the new value of the coherent interactor ^ at each site R is obtai- 

ned from the relation 

>“’<“> - «“'<“+■']"' = sj'”> , (71) 

or, explicitly, 

/?“’(”+!) = • (72) 

These three steps have to be repeated in order to obtain converged quantities 
and at all sites. Steps (69, 70, 72) preserve the Herglotz property of the 
matrix quantities 1?^, V^, Convergence is achieved typically after 5 to 20 

iterations depending on the alloy system and the complex energy variable. 

Substantial acceleration of charge selfconsistent calculations for random sy- 
stems can be achieved by repeated alternation of one CPA iteration and one up- 
date of one-electron potentials (see Sect. 5). In such case, the potential functions 
of all alloy components in (68, 69) are replaced by the n-dependent quantities 
p<^Q,{n) update of the one-electron potentials and the potential functions 
follows the CPA iteration (69, 70, 72) and is based on charge densities derived 
from the conditionally averaged local auxiliary Green functions 




(73) 



In this way, the CPA selfconsistency is obtained simultaneously with the LSDA 
selfconsistency. 
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4 Surfaces and Interfaces 



Applications of the single-site CPA to layered systems on lattices with two- 
dimensional (2D) translational symmetry require special approaches to calculate 
the Green function quantities involved. Below we summarize the most essential 
relations of a technique based on the concept of principal layers and the surface 
Green functions [26-28], 

The approach rests on the use of the tight-binding LMTO representation 
P which provides the most localized structure constants 5'^^ [16,17], and 

on the representation invariance of the CPA (Sect. 3.3). The finite range of the 
tight-binding structure constants allows to introduce the principal layers in such 
a way that (i) each principal layer consists of a finite number of neighboring 
atomic layers, (ii) the whole lattice can be considered as a stacking of an infinite 
sequence of the principal layers labeled by an integer index p, see Fig. 3, and (hi) 
the structure constants couple only the neighboring principal layers. 

The sites J? of a given system can be then written in a form R = (p, B,T||), 
where p is the index of the principal layer, B denotes the corresponding basis 
vector (mostly an atomic layer) in the p-th principal layer, and T|| is a 2D 
translation vector such that J? = B + T|| . We assume for simplicity that each 
principal layer contains the same number ne of the basis vectors B. 
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Fig. 3. Principal layers for a single interface of two semi-infinite systems 



As a consequence of the 2D translational symmetry of the lattice, a 2D lattice 
Fourier transformation of the structure constant matrix leads to a fe|| -dependent 
matrix 



^pBL,p'B'L'(^\\) — X/ ^pBL,p'(B'+T^^)L' ®xp(i ^H' ^|| ) i C74) 

where fc|| denotes a vector in the 2D BZ. It should be noted that the tight- 
binding structure constants (74) vanish for |p — p'| > 1, i.e., they form a block 
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tridiagonal matrix with respect to the principal-layer index. In the following, we 
will often omit the composed matrix index BL so that matrix quantities with 
elements XpBL,p'B'L' and Wp^BL,B'L' will be respectively abbreviated as Xp^p> 
and Wp. The dimension of the latter matrices is equal to nB(^max + 1)^, where 
^max denotes the angular-momentum cutoff. 

Let us consider the case of a single interface of two semi-infinite systems. Ex- 
amples of this situation are: a surface of a bulk alloy (solid- vacuum interface), an 
epitaxial interface of two alloys (metals), a special grain boundary in a bulk me- 
tal, etc. The treatment of all these cases can be greatly simplified due to the fact 
that all inhomogeneities are confined to an intermediate region of a finite thickn- 
ess (principal layers 1 < p < M) placed between two semi-infinite substrates, sec 
Fig. 3. The electronic properties (e.g., the coherent potential functions) of both 
unperturbed substrates are supposed to be known and the main interest then 
concentrates on the intermediate region. Let us assume that the configuration- 
independent properties of the layered alloy system exhibit the 2D translational 
symmetry of the underlying lattice. As a consequence, the coherent potential 
functions for all lattice sites reduce to pB-resolved quantities VpB which 

form matrices V!^{z) with elements Vp bl b'L'(^) ~ '^pB ^bb'- The aver- 

age auxiliary Green function (24) can be then calculated using the corresponding 
lattice Fourier transform gpBL p’B'L'i^W^ 

The layer-diagonal (p = p') blocks of the latter can be expressed as 

9p,p{k\\^z) = V^{z) - 5^,p(fei|) 

- rp^’<m,z) - rp^’>m,z)Y' , (75) 

where the the first two terms in the bracket correspond to the isolated p-th layer 
while the so-called embedding potentials z) and z) reflect the 

influence of the two semi-infinite parts adjacent to the p-th principal layer - 
the superscript < (>) refers to the part consisting of all principal layers p' < p 
{p’ > p). The embedding potentials for layers inside the intermediate region 
(1 < p < M) can be calculated from recursion relations 

r^^><(fe,|,^) = <,_i(fc||) [v^p.M - ^p^q,_i(fe||) 

-rp^>(fcl|,z)]”'5p^+i^^(fell), (76) 

and from the starting values 

Tf'<(fc,|,z) = 5fo(fc||)gf,ft(fe,|,z)Ao^,i(fe||), 

~ *^M,M + l(^l|) ^right(^|b ’^M+1 ,m(^||) ■ 



(77) 
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The matrix quantities &ught(^ll > C^'^) surface Green 

functions of the two semi-infinite substrates sandwiching the intermediate region. 
The calculation of the layer-diagonal blocks of the Green function according to 
(75, 76) is obviously an order-M procedure. 

The surface Green function (SGF) is defined as a projection of the full Green 
function of a semi-infinite layered system onto its outer principal layer. For a 
semi-infinite system consisting of identical principal layers, one can apply the 
concept of removal invariance [2] to derive a closed condition for the SGF which 
reflects the true semi-infinite geometry of the system. In the case of the left 
substrate in Fig. 3, this condition is 

- GLik\\,z) 5^i.o(fe||)]^' > (78) 

whereas an analogous condition for the right substrate is omitted here for brevity. 
Both conditions are of the same form, namely 

g = {D - AQB)-^ , (79) 

where Q is the SGF and where the fe||- and 2 ;-arguments were suppressed. The 
most direct method to solve (79) is based on simple iterations [28] 

g(n+l) ^ (D - B)-^ (80) 

starting from an input value which can be set either to zero or, e.g., to 
the converged SGF for a neighboring energy argument. The latter choice of 
g^^'^ substantially reduces the number of necessary iteration steps, especially for 
complex energies close to the real axis. The iterative procedure (80) is easy to 
implement, leads always to the correct solution of (79) satisfying the Herglotz 
property, and has a direct physical meaning: g^P with the initial value = 0 
corresponds to the SGF of a stacking of n identical principal layers. The number 
of steps to get a converged SGF depends on the imaginary part of the complex 
energy z, but in most applications several tens of iterations are sufficient. In 
the cases where an enhanced accuracy of the SGF and/or very small Im 2 (less 
than 10 niRy) are needed, the SGF can be more efficiently obtained by means of 
the renormalization-decimation technique [15,18,29]. The high efficiency of the 
latter method is due to an exponential increase of the thickness of an effective 
layer with the number of iterations, in contrast to the linear increase inherent 
to the simple procedure (80). 

For evaluation of local physical observables as well as for the solution of the 
CPA condition, the site-diagonal blocks of the average auxiliary Green function 
are of central importance (see Sect. 3). They can be obtained by a 2D BZ- 
integration of (75) as 




( 81 ) 
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where A^|| is the number of fc|| -points sampling the 2D BZ. The layer-diagonal fc||- 
dependent average auxiliary Green functions (75) enter also the corresponding 
Bloch spectral functions (fc|| -resolved densities of states). They are defined in 
analogy to (56) as 

•ApBL{f^\\,E) = Im GpBL,pBL{f^\\, E + iO) , (82) 

7T 

where the lattice Fourier transform of the average physical Green function is 
given by (cf. (57)) 

GpBL,pBL{k\\, z) = E ■^pB,LL'(^) 9pBL',pBL"i^h^) ■^pB,L"l(^) 

L'L" 

+ ^pB,ll(^) ■ (^'^) 

It should be noted that the Bloch spectral functions (82) represent a suitable 
tool to study surface/interface states in disordered as well as ordered layered 
systems. 

Figure 4 shows the local densities of states at the (001) surface of a random 
non-magnetic bcc Feo.isVo.ss alloy. One can clearly see a rapid convergence of 
the layer-resolved densities to their bulk counterparts, which justifies numerically 
the concept of the intermediate region of a finite thickness. The bands in the 
top surface layer are narrower than the bulk ones and the pronounced minima 
in the middle of the bcc bulk bands are absent in the top surface layer. These 
effects can be ascribed to the reduced coordination of the surface atoms. As 
a consequence, both components exhibit a strong enhancement of the surface 
densities of states at the Fermi energy (see Fig. 4) which in turn can induce a 
surface magnetic instability of the non-magnetic bulk alloy. 

5 Charge Selfconsistency for Random Alloys 

The LSDA selfconsistency for substitutionally disordered systems within the 
GPA and the ASA is based on the average component-resolved charge densities 
(42). In the following formulas, we use atomic Rydberg units (e^ = 2) and 
assume a spin-polarized non-relativistic system with a collinear spin structure. 
The spin-dependent charge densities inside the individual atomic spheres will 
be denoted ^^^.(r) where ct = t, j, is the spin index. Related quantities are the 
spherically averaged spin-dependent densities 

and the total electronic charge densities 




370 



Ilja Turek et al. 




- 0.6 - 0.3 0 0.3 



Energy (Ry) 



Fig. 4. Layer-resolved local densities of 
states for Fe (full lines) and V {dotted 
lines) atoms at the (001) surface of the 
random bcc Feo.isVo.ss alloy. The top 
four layers and the bulk layer are deno- 
ted by S, S-1, S-2, S-3, and bulk, res- 
pectively. The vertical line denotes the 
position of the Fermi energy 



The selfconsistcnt onc-elcctron component-dependent ASA potentials arc then 
given by 



V^^ir) ^ -2Z%r-^ + I 2 ~g%{r') \r - 

■HR) 

+ + VMad,Ks , (86) 

where the integration is carried out over the J?-th atomic sphere. The first term 
in (86) is the Coulomb potential due to the point-like nuclear charge Z'^, the 
second term is the Hartree potential due to the spherically symmetric charge 
density g%{r), the third term represents the exchange-correlation contribution, 
and the last term is the Madelung contribution. The exchange-correlation term 
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is evaluated according to a standard relation 

d 

Kc.<7(£<t> e’i) = ^ [(£<t + £’;) £xc(eT,et)] , (87) 

where £xc(£'t> Pt) i® exchange-correlation energy per particle of a spin-polarized 
homogeneous electron gas. The Madelung contribution VMad.Jis in (86) is a spe- 
cial case (for L = (£, m) = (0, 0)) of the multipole Madelung terms defined 

by 

TlvIad.fiL = Mrl.R'L' QR' L' , (88) 

R'L' 

where the primed sum indicates exclusion of the term J2' = R. The constants 
Mrl r'l' in (88) describe the electrostatic interactions between two multipoles 
located at the sites J?, R' and the quantities qRL are average multipole moments 
due to the total (electronic and nuclear) charge densities inside the atomic sphe- 
res, 

E Q Q 
Cr <1rl > 

Q 

Let us note that qjis and (£ = 0 in (89)) refer to the net charges inside 
the J?-th sphere. The summations in (88) for infinite lattices with two- or three- 
dimensional translational symmetry can be performed using the corresponding 
Ewald techniques [15,18,30]. For bulk systems the Madelung contribution (88) is 
often calculated only from the net charges qns , whereas for surfaces an inclusion 
of the dipole moments is inevitable, e.g., for a good description of the surface 
dipole barrier and the work function [30]. 

For calculations of charge densities, the energy dependence of the regular 
radial amplitude in (3) is replaced by a truncated Taylor expansion at an energy 
^uRia ^be center of the occupied part of the valence band 

+ \^%eAr){E-E%S ^ ( 90 ) 

which results in a simple expression for the spherically averaged charge density 

(84) 
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In (91) the last term denotes the core contribution while the quantities 

(fc = 0, 1, 2) represent the lowest energy moments of the Q-R£cr-projected valence 

densities of states (41) 



m 



Q,k 

Ria 




m=—i 



(92) 



where Eb denotes the bottom of the valence band. Similarly, the multipole 
moments (89) reduce to several radial and energy integrations [15,30]. The 
latter are of the type {k, k' — 0, 1, 2) 



m 



Q, kk' 

R, LL',a 




E. 



Q 

iy,Ricr 



\k Q 

) nr 



R,LL',g 



[E) {E - E. 



Q 

i/,R£.' a 



fdE 



(93) 



representing thus the lowest energy moments of the local density of states matrix 
’^R,LL' ,aiE) (40). 

As follows from (40), the energy integrals (92, 93) over the occupied part of 
the valence spectrum can be generally formulated as 



1 



7T 




Im F{E + iO) dE 



1 

27ri 



ic 



F{z) dz . 



(94) 



The function F[z) is an analytic function of the complex energy variable 2 
(except at poles and/or branch cuts lying on the real energy axis) which satisfies 
F{z*) = F*{z). The r.h.s. integral in (94) is taken along a closed contour C 
intersecting the real energy axis at the Fermi level and enclosing the occupied 
valence band. Standard quadrature techniques lead to an approximation 



1 

27ri 



'C 



F{z) dz 



Re 



N 

E F{Zn) 

n=l 



(95) 



which replaces the original integral along the real axis by a finite sum with N 
complex weights Wn and nodes G C. All nodes Zn can be chosen in the upper 
(or the lower) complex halfplane. They are usually taken along a semicircle 
contour with a denser mesh near the Fermi energy. Experience shows that a 
relatively modest number of nodes {N ~ 10 to 20) is sufficient to achieve desired 
accuracy in most charge selfconsistent calculations. Minor complications arise in 
selfconsistent bulk calculations in which the Fermi energy Ep is unknown and 
changes in each iteration (contrary to the case of surfaces where the value of 
Ep is fixed from a previous calculation of the bulk substrate). Fortunately, it is 
not necessary to locate the bulk Fermi level exactly in each iteration but merely 
to update its value so that the convergence of Ep proceeds simultaneously with 
that of the one-electron potentials. 

Iterative procedures leading to selfconsistent one-electron potentials (or charge 
densities) have been recently systematically accelerated by means of quasi-Newton 
methods (like the Anderson and the second Broyden mixing scheme) [31]. These 
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techniques are efhcient also for random alloy systems where the LSDA-CPA 
selfconsistency can be achieved by alternating updates of the one-electron po- 
tentials and the coherent interactors (Sect. 3.4). According to our experience, the 
full convergence in all-electron calculations can be obtained in 30 to 80 iterations 
for most systems. 

The total energy for non-random systems within the ASA [17,20,30] can 
be directly generalized to the case with substitutional randomness. The final 
formula is given by a concentration-weighted sum of JiQ-dependent terms 

^ = E 4 > (96) 

RQ 

where the individual contributions are explicitly given by 




+ 2 E ^Mad, HL • (97) 

L 

The first term in (97) is the sum of core eigenvalues labeled by j, while 

the second term represents an energy contribution due to the valence densities 
of states (41). It can be trivially expressed in terms of the moments 

( 92 ). 

Let us note that the above presented formulas for the one-electron potentials 
(86) and for the total-energy contributions (97) were derived under a complete 
neglect of any correlations (i) between the occupation of a particular site R and 
the charge densities inside the other atomic spheres, and (ii) between the charge 
densities inside different atomic spheres. These neglected correlations result then 
in the component-independent Madelung terms VMad,RL (88) due to the average 
multipole moments qwL' (89). This simple treatment is fully compatible with 
the mean-field nature of the single-site CPA. 

However, it has been found in a number of applications of the CPA that 
the neglected charge correlations lead to substantial errors in the calculated 
total energies. Several schemes were suggested to remove this drawback. Let us 
consider for simplicity only the case of random binary bcc or fee alloys. The 
condition of the overall charge neutrality together with the neglect of higher 
multipole moments leads to a vanishing mean-field Madelung contribution pMad.s 
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to the one-electron potential (the site index R is omitted). The screened CPA [32] 
and the screened impurity model [33] lead to a component-dependent Madelung 
term 

^Mad,. = - 2 g? d-^ , (98) 

where dnn denotes the distance between the nearest neighboring sites of the 
lattice. This shift of the one-clectron potentials follows from an assumption of a 
perfect screening of the net charge qf by compensating charges located on the 
nearest neighbors. The correction to the total alloy energy per lattice site is then 
given by 

= /5 (99) 

Q 

where the prefactor (3 equals 1/2 for the screened CPA [32,34] whereas for 
the screened impurity model the whole interval 1/2 < /3 < 1 was considered 
[33,35,36]. Another approach employs an idea of neutral atomic spheres [37,38] 
where the sphere radii are changed (keeping the average atomic volume fi- 
xed) to achieve vanishing net charges (g^ = 0) for both alloy components. All of 
these schemes improve considerably the calculated total energies for many alloy 
systems but a detailed assessment of their validity especially for alloy surfaces 
remains yet to be done. 

6 Extensions and Applications of the LMTO-CPA 

The non-relativistic TB-LMTO-CPA theory of substitutionally disordered al- 
loys can be generaliiied to include properly all relativistic effects based on the 
Dirac equation. The relativistic theory in the non-magnetic case represents a 
straightforward modification of the non-relativistic counterpart [15,39] whereas 
for spin-polarized systems certain theoretical as well as technical problems ap- 
pear [15,40]. Nevertheless, many of the theoretical concepts introduced above 
remain valid. 

The energetics of metallic alloys and their surfaces with applications to orde- 
ring and segregation phenomena is usually studied in terms of effective interato- 
mic interactions. They can be determined from ab initio electronic structure cal- 
culations using either the generalized perturbation method [23] or the Connolly- 
Williams inversion scheme [41]. In the context of the LMTO-CPA theory, the 
generalized perturbation method was described in [15,42] and reviewed in [43], 
while a modification of the Connolly-Williams approach was developed in [44] . 

Recent applications of the selfconsistent LMTO-CPA method cover a large 
area of the modern theory of alloys. The ground-state properties of non-magnetic 
bulk random alloys were investigated, e.g., in [33,35,38], while the Fermi surfaces 
and the electronic topological transitions were studied in [45]. Existing appli- 
cations to magnetic bulk alloys include studies of the local magnetic moments 
[40,46], various aspects of the Invar alloys [47,48], the structural stability [49,50], 
the ordering tendencies [49,51,52], and the Curie temperatures [43]. 
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The electronic structure of surfaces of random alloys was investigated, e.g., in 
[39,53-55], studies of the surface segregation were published in [42-44,54,56,57], 
and calculations of the surface magnetic properties of random alloys were pre- 
sented in [58,59]. 

Two-dimensional random alloys which can be formed at an epitaxial interface 
of two different metals represent another field of applicability of the LMTO-CPA 
method. The electronic structure of non-magnctic random overlayers on metallic 
substrates was calculated, e.g., in [55,60,61] while random magnetic overlayers 
on non-magnetic substrates were studied in [46,62,63]. The adlayer core-level 
shifts of random overlayers were calculated in [64], the ordering tendencies in 
surface non-magnetic alloys were analysed in [65], the interplay of magnetism 
and ordering was considered in [51], and the stability of metallic interfaces was 
investigated in [66]. 

The interlayer exchange coupling, encountered in epitaxial magnetic multi- 
layers, is another quantity which can be influenced by substitutional disorder 
both in the magnetic layers and in the non-magnetic spacer. Applications of 
the LMTO-CPA to this problem can be found in the review [67] and references 
therein. 

The formalism presented in this paper as well as the applications listed above 
are heavily based on the ASA. A development of a full-potential version of the 
LMTO-CPA is difficult due to the dependence of each LMTO on the occupation 
of all lattice sites. This complicated configuration dependence of the LMTO’s 
can be removed in the so-called pure-L approximation for the TB-LMTO’s [68] 
and the corresponding single-site CPA theory can be then derived. This was 
done in [69,70] together with applications to random fee Li-Al and Ni-Pt bulk 
alloys. 
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Abstract. A formulation of the order- A locally self-consistent Green’s function, LSGF, 
method in conjunction with the linear mufhn-tin orbital (LMTO) basis set is discussed. 
The method is particularly suitable for calculating the electronic structure of systems 
with an arbitrary distribution of atoms of different kinds on an underlying crystal 
lattice. We show that in the framework of the tight-binding representation it can be 
generalized to systems without ideal three-dimensional symmetry of the underlying 
lattice, like, for instance, alloys with local lattice relaxations or surface alloys. We also 
show that multipole corrections to the atomic sphere approximation can be easily in- 
corporated into the formalism. Thus, the method represents a powerful tool for studing 
different problems within alloy theory. 



1 Introduction 

Recent research in solid state physics has shown a number of encouraging results 
for the investigation of physical properties of metallic alloys. In particular, the 
computational schemes which allow one to treat ordered, as well as random al- 
loys, their surfaces and interfaces have been developed and applied with a great 
success. This has led to a much deepened understanding of the behavior of ther- 
modynamic and magnetic properties, structural and phase stabilities, impurity, 
surface and segregation energies through the transition metal series [1-5]. On 
the other hand, first-principles investigations arc still limited to certain ideal sy- 
stems, like, for instance, completely ordered or completely random alloys, while 
for materials and problems of technological importance these studies are quite 
rare. A possible improvement of this circumstance consists in the development of 
more efficient computational schemes, for example, methods that scale linearly 
with increasing number of atoms in the system (order- A methods), thereby al- 
lowing a study of more realistic systems. 

The problem is schematically illustrated in Fig. 1. In the framework of density 
functional theory (DFT) [6] our purpose is to solve the Kohn-Sham equations [7] 
for an infinite system of atoms. This set of effective one-electron equations is con- 
ventionally solved with a particular choice of basis functions. However, such an 
approach relies on the Bloch theorem, and thus requires ideal three-dimensional 
periodicity of the system at hand. If this is not the case, the periodicity is usually 
imposed artificially by considering only a finite part of the original system, the 



H. Dreysse (Ed.): Workshop 1998, LNP 535, pp. 379-398, 1999. 
© Springer-Verlag Berlin Heidelberg 1999 




380 



I. A. Abrikosov et al. 



so called siipcrccll, subject to periodic boundary conditions (Fig. la). This al- 
lows one to construct a Hamiltonian matrix which upon Fourier transformation 
to reciprocal k-space has a dimension proportional to the number of atoms N 
in the supercell. But due to the famous 0{N^) scaling problem (i.e. the com- 
putational time increases as with the number of atoms N in the supercell) 
the size of the cell is often limited by the computer power rather than by the 
physical problem itself. On the other hand, to account for short range order ef- 
fects in alloys [8,9] or to calculate interaction energies between point defects in a 
metal [10] one needs supercells with more than a hundred of atoms. For such big 
supcrcclls conventional approaches are not at all efficient, and scaling properties 
of the computational technique must be improved. 
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Fig. 1. Three ways to calculate electronic structure of infinite system composed of 
chemically nonequivalent atoms {filled and open circles) with an arbitrary degree of 
disorder, (a) In the framework of the supercell method one chooses some part of the 
original system (indicated by a full line), and repeats it periodically, (b) Conventional 
0{N) methods are based on the direct solution of the electronic structure problem for 
a finite part of the original system {dashed line), the local interaction zone, centered at 
all the sites of this system, (c) Within the LSGF method the boundary conditions for 
the multiple scattering problem inside a LIZ are substantially improved by embedding 
the LIZ into a self-consistent effective medium {gray eireles). As a result the size the 
LIZ is greatly reduced {dotted line) 



Within the last decade considerable attention has been devoted to the deve- 
lopment of so-called 0{N) methods. Most of them are based on the principle of 
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nearsightedness [11], illustrated in Fig. lb. Similar to the supereell method, one 
considers a finite subsystem of the original system, the so called local interaction 
zone (LIZ) [12,13]. But rather than introducing periodic boundary conditions, 
one solves the electronic structure problem for the particular LIZ, and then pro- 
ceeds to the next LIZ, thus dividing the iV-atom problem into N more or less 
independent problems for each LIZ. Such a procedure can be partly justified by 
the experience gained in the application of real-space cluster methods in elec- 
tronic structure calculations that shows that for a large cluster the properties of 
an atom deep inside the cluster are very close to those given by band structure 
methods. Also, it guarantees linear scaling of the computational efforts with N. 

Several 0{N) computational techniques differ from each other mainly by the 
methods of solving the electronic structure problem for the LIZ. For example, 
in Refs. [14-18] this is done by employing localized orbitals. Refs. [19-23] take 
advantages of the density matrix formulation, in Refs. [24,25] tigh-binding (TB) 
representation is used, while the techniques presented in Refs. [12,13,26,27] are 
based on a Green’s function approach. However, if there are M atoms in the 
LIZ, then the computational effort required for an exact solution of the elec- 
tronic structure problem inside a particular LIZ scales as . Thus the total 
computational time scales as M^N . One can see from Fig. 2 that there is a cer- 
tain minimum number of atoms N for which the 0{N) methods become more 
efficient than conventional 0{N^) methods. Thus, the problem with 0{N) me- 
thods is not only to achieve the linear scaling of the computational efforts with 
increasing number of atoms, but also to minimize the size of the LIZ. Unfortuna- 
tely, applications of the above mentioned linear scaling methods are very limited 
due to the fact that the size of their LIZ must in general be chosen quite large to 
give reliable description of the electronic properties, especially for metals [12,13]. 

The origin of this drawback can be understood as follows. From an illustra- 
tion in Fig. lb one can see that all the information about the system beyond the 
LIZ is totally neglected. Therefore, one needs to keep too many atoms directly 
in the LIZ in order not to loose essential physical information about the original 
system. Thus, there is a question if one can make the central atom of the LIZ 
more nearsighted. In Refs. [8,28] we have suggested that one can do this if one 
keeps at least some information about the system beyond the truncation region 
in the form of an effective medium (Fig. Ic). And, of course, the more informa- 
tion is kept, the better the convergence will be with respect to the size of the 
LIZ. Therefore, we have suggested to choose the effective medium that describes 
properties of the original system on average as accurate as possible. It is also 
clear that the symmetry of the effective medium may be much higher than the 
symmetry of the system under consideration. In Refs. [8,28] we have demon- 
strated how the computational effort of 0{N) calculations may be considerably 
reduced by embedding the LIZ in this effective medium (see Fig. Ic). Such an 
embedding may be established by means of the Dyson equation connecting the 
desired Green’s function to the Green’s function of the reference system. 

In the present paper we first review the main ideas of the locally self-consistent 
Green’s function (LSGF) technique proposed by Abrikosov et al. [8,28]. We then 
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Fig. 2. Schematic representation of computational efforts required to solving the el- 
ectronic structure problem by a conventional 0{N^) method {full line), as well as by 
0{N) methods with different number of atoms, M {dashed line) and 2M {long-dashed 
line), in the local interaction zone 



discuss some resent developments, in particular, multipole corrections to the 
atomic sphere approximation (ASA) [29,30] for the one-electron potential. We 
will next show how the tight-binding representation of the linear muffin-tin orbi- 
tal method [31-35] can be incorporated in our Green’s function technique. This 
will allow us to formulate general equations for systems without ideal three- 
dimensional symmetry of the underlying lattice, like, for instance, alloys with 
local lattice relaxations or surface alloys. Applications of the LSGF method to 
several problems of solid state physics will also be briefly discussed. 

2 Locally Self-Consistent Green’s Function Method 

2.1 Assumptions and Definitions 

We will formulate the locally self-consistent Greens function method in the fra- 
mework of the density functional theory and the local spin density approximation 
(LSDA) [6,7] in conjunction with a linear muffin-tin orbitals basis in the atomic 
sphere approximation of Andersen [31-35] including multipole correction terms 
[29,30], ASA+M. In this section we will consider the problem of calculating the 
electronic structure of a system of N atoms illustrated in Fig. 1. Though this 
is not necessary, we assume that the original system is represented by a super- 
cell subjected to periodic boundary conditions. This assumption allows us to 
calculate easily the long-ranged electrostatic contributions to the one-electron 
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potential and energy, and to construct appropriate LIZ for atoms close to the 
boundary of the supercell. We allow for an arbitrary distribution of the atoms 
that may be of different types on the sites of the underlying crystal lattice. Ho- 
wever, in this section we do not allow these lattice sites to deviate from their 
ideal positions, i.e. we neglect the so-called local relaxation effects. This problem 
will be considered in Sec. 3.2. 

We will assign to each site of the supercell a corresponding atomic sphere (AS) 
centered at radius-vector R. Inside each AS we will define the electron density 
PR,(r) and one-electron potential UR,(r). Also the potential function P^i{z) 

[z - Cr/)(7r; — aij + Ari 

for an arbitrary (complex) energy x and angular-momentum quantum number 
I may be assigned to each site. It is expressed by means of the band center C, 
bandwidth A, and the 7 LMTO potential parameters calculated at an arbitrary 
energy £i/r; in the energy range of interest [33]. Index a denotes the LMTO 
representation, and in the following discussion it will be used only when relevant. 
The so-called KKR-ASA Green’s function matrix g is defined like 



[P(^) - S(k)]g(k,2;) = 1, (2) 

where P , S and g are (RL, R' L') matrices, L is the combined angular-momentum 
quantum numbers and the structure constant matrix S contains all the 

information about the crystal structure. The real space KKR-ASA Green’s fun- 
ction matrix g-R.L,ivL'{z) is obtained from g(k) by integration over the Brillouin 
zone 



g-R.L,-R'L'{z) = {Vbz) ^ ^3UL,U'L'(k, x), (3) 

JBZ 

where Vbz is the volume of the Brillouin zone, U is a basis vector of the unit 
cell. It is connected to the lattice site R by a translation T, i.e., R = U-l-T. The 
on-site element prl,rl'( 2 ) is a key quantity which determines both the electron 
density and the density of states at the R-th site. Therefore, our purpose is to 
calculate these matrix elements for all the sites of the supcrccll. In a conventional 
Green’s function technique this requires an inversion of Eq. (2) at each k-point in 
the Brillouin zone and for each energy point z. This is essentially the operation 
that results in 0{N^) scaling of the problem. In the following we will show how 
this problem can be solved without a direct inversion of Eq. (2) in the framework 
of the LSGF method. 

2.2 Concept of Local Self-Consistency and Effective Medium 

The concept of local self-consistency is based on the suggestion that the electron 
density and the density of states on a particular atom within a large condensed 
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system may be obtained with sufficient accuracy by considering only the electro- 
nic multiple scattering processes in a finite spatial region centered at that atom. 
This concept has been applied by by Nicholson et al. [12,13] in the framework of 
the locally self-consistent multiple scattering (LSMS) method. Within this me- 
thod each atom of a system is surrounded by few shells of nearest neighbors (see 
Fig. lb), the local interaction zone, and the multiple scattering problem is sol- 
ved in the real space for each LIZ independently of the other LIZs. The method 
was proven to have essentially 0{N) scaling properties with increasing supercell 
size. Nevertheless, it turns out that the minimal size of the LIZ should be about 
100 atoms and even more to provide the accuracy of order 0.1 mRy / atom for 
the total energy. It is quite clear that the reliable description of the central site 
scattering properties will be achieved only when the size of the LIZ is so large, 
that an atom at this site becomes completely insensitive to what is happening 
beyond the LIZ. In particular, if one adds an extra atom at the LIZ’s boundary, 
this should have no effect on the central site. 

The concept of local self-consistency has been further developed by Abrikosov 
et al. [8,28] in the locally self-consistent Green’s function method (LSGF), where 
the concept of an effective medium has been combined together with the LIZ 
concept. This resulted in the development of an efficient 0{N) technique. The 
combination of these two concepts can be most easily illustrated by the example 
of a substitutional alloy with an arbitrary degree of short and long range order. 

In choosing the effective medium we tried to satisfy the following main cri- 
teria, namely its scattering properties as viewed by the central atom of the LIZ 
must be as close as possible to those of the original system at the shortest pos- 
sible distance. Additionally, it has to be as simple (i.e. symmetric) as possible. 
In the case of the above mentioned substitutional alloy it is clear that a very 
good representation of the real system beyond the LIZ would be a completely 
random alloy. In particular, one can expect that the convergence with respect 
to the LIZ size will be achieved as soon as the central site becomes insensitive 
to the interchange of positions of two atoms at the LIZ’s boundary, in contrast 
to a requirement of the complete insensitivity in conventional 0{N) schemes. 
Also, the random alloy has on the average the highest possible symmetry, and 
this will allow us to perform at least partly calculations in the reciprocal space 
of the effective lattice which in its own turn has a minimal number of atoms in 
the unit cell. 

Now, there is a question of how to represent mathematically the effective 
medium in the form of a completely random alloy. Based on the experience gai- 
ned in the study of alloys Abrikosov et al. [8,28] have suggested to do this in 
the framework of the multicomponent generalization of the coherent-potential 
approximation (CPA). In fact, the CPA effective medium fulhlls both the above 
mentioned criteria. Despite the fact that the CPA is a very simple single-site 
approximation, it has been shown to give density of states for random alloys 
in very good agreement with experiment as well as with more accurate calcu- 
lations that go beyond the single-site approximation [36-38]. In addition, the 
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CPA Green’s function decays, apart from an oscillating factor, exponentially as 
where I is the mean free path [2]. 

However, conventionally the CPA describes scattering property of a binary 
alloy in terms of two potentials, for A and B alloy component. In reality situation 
is different, and in a N atom system, even though it is composed of just two 
chemically inequivalent atoms, A and B, all the local potentials are in general 
inequivalent due to the differences in local environments of any site. Therefore, 
one must define the effective medium to be used in the embedding of the LIZ as 
that given by the CPA for a multicomponent alloy. The number of components at 
each sublattice will be equal to the number of equivalent positions in the supcrccll 
formed from the underlying lattice, i.e., in the simplest case of a monoatomic 
underlying lattice each atom in the supercell is considered to be a component of 
an A-atom alloy. We thereby assume that the atoms are randomly distributed on 
their sublattices and neglect the fact that they occupy definite positions in the 
system. The difference between different atoms (or alloy components) will enter 
through their one-electron potentials. This method has recently been elevated 
to an idea of polymorphous CPA [38] . 

2.3 Computational Algorithm 

In this section we will describe in details the complete self-consistent procedure 
of the LSGF method within the LMTO basis set as applied to the problem of 
calculating the electronic properties of a paramagnetic system of N atoms in a 
supercell subjected to periodic boundary conditions, such as illustrated in Fig. 1. 
Generalization to systems with collinear magnetic moments is straightforward, 
but there also exists a generalization of the method for systems with arbitrary 
orientations of local magnetic moments [39] . 

The principal scheme is similar to any other Green’s function technique 
[34,40-43], but there are certain new steps that are specific for the LSGF me- 
thod. We start with a guess for the charge density of all the atoms in the system, 
calculated as a renormalized atomic density or by means of a conventional CPA 
calculation for a random alloy with the same composition as in the supercell. 
The first problem is to construct an effective medium (all parameters that cha- 
racterize the latter are denoted by a tilde). This means we must determine the 
potential function for the effective scatterers P and the effective Green’s function 
g. Here we remark, that in some cases it turns out to be more efficient to chose 
the effective medium in the form of multisublattice alloy. In particular, it has 
been demonstrated in Ref. [28] that for a partially ordered Ni(Ni 9 . 375 Al 9 o, 625 ) 
alloy the convergence with respect to the size of the LIZ is substantially enhan- 
ced by substituting the most symmetric bcc effective medium for an effective 
medium on B2 underlying lattice with two nonequivalcnt types of the effective 
atoms, one for each nonequivalent sublattice. Thus, one must solve the following 
system of coupled single-site equations for the U sites in the unit cell of the 
underlying lattice [43,44] 
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5tju = (Yez)-^ f dk ([P - S(k)]-i)uu 

JBZ 

5r = 3uu + ffuu(-Pu ~ -Pr)5r (4) 

5uu = (-^u) ^ ^3 r> 

R 

where U denotes a particular sublattice of the underlying lattice, R £ U, i.e., 
R = U + T, Vbz is the volume of the Brillouin zone of the underlying unit cell 
and the integration is performed over the corresponding Brillouin zone, S is the 
structure constant matrix of the underlying lattice. In Eq. (4) N-^j is a number 
of atoms at the corresponding sublattice of the effective lattice, i.e. = N for 
a Bravais lattice, while, for instance, for the B2 lattice N-^ = N/2. Eq. (4) must 
be solved for each sublattice of the effective lattice. In practice it is solved by 
iterations with an initial guess for the effective potential function in the following 
form: 



^ (5) 

R6U 

using an efficient procedure described in details in Ref. [28]. 

As has been specified in Sec. 2.1, the key quantity to be calculated in order 
to get access to all the electronic properties of a system is on-site elements of the 
Green’s function matrix, 5 rl,rl'(-z)- In the framework of the LSGF method we 
calculate them using the concept of the local self-consistency, i.e. for each site 
of the supercell separately. This site is considered as a central site of a corre- 
sponding LIZ embedded in the effective medium constructed by the procedure 
described above (Fig Ic). The Green’s function for the LIZ can be found by 
solving corresponding Dyson equation 

M 

.9rr = .9rr + ^ .9rr'(^R' ^ 4^r').9r'R) (6) 

R' = l 

where the sum runs over the M atoms in the LIZ around site R. Eq. (6) has to 
be solved for all the sites in the original system. Thus, the problem of solving the 
Kohn-Sham equations for an A^-atoni system is decomposed into N linked locally 
self-consistent problems for the LIZ associated with each atom in the system. 
Note also that in Eq.(6) 9 rr' is a matrix element of the complete (diagonal, 
as well as off-diagonal) Green’s function matrix of the effective medium, and 
it is calculated from the corresponding Brilluoin zone integral, Eq. (3), where 
U now represents a basis vector of the underlying lattice U. At each iteration 
and for any energy point this Green’s function must be calcidated only once for 
the entire system, and shall not be updated when moving the LIZ from one site 
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to another due to the invariance of all the sites of the underlying lattice that 
belong to the same sublattice U with respect to a translation T. Moreover, due 
to the high symmetry of the effective medium calculating integral (3) is not a 
time-consuming operation. 

We remark that in general ,9 rr' calculated by the Dyson equation (6) will not 
correspond to the one of a real system, especially close to the LIZ boundary. But 
the on-site element for the central atom will approach that of the real atom at R 
for a sufficiently large LIZ. In this sense (?rr will be locally self-consistent. This 
also distinguishes a solution of Eq. (6) from a solution of a single-site CPA Eq. 
(4), thus justifying our use of different notations for the corresponding on-site 
elements of the Green’s function in these equations. 

After the on-site elements of the Green’s function have been calculated for 
the entire system, i.e. Eq. (6) has been solved for all the sites R, the remaining 
procedure is completely similar to any conventional Green’s function technique 
[34,40-43]. We proceed by transforming the KKR-ASA Green’s function g into 
the Hamiltonian Green’s function G: 



^RL.RL'(^) “ 
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where the LMTO-representation dependent potential parameters V-^, 
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Then the moments of the state density can be calculated 



(8) 



1 



t-rI'L" = 7F- f G'^L',RL"iz){z - 



r ' ~’RL',RL' 

and using the second-order Taylor expansion of the partial wave, 0Rg(e;r): 



(9) 



(pRi{£;r) ^ (j)uRi{r) + {e - £uRi) ^uRi{r) + - euRi)"^ ii^Ri{r), (10) 

the valence charge density n^(r) in the corresponding atomic spheres can be 
found as the one-centre expansion 



nR{r) = (47t) ^ ^{[(l)uRi{r)frn^LL + 

L 

+ [^uRi{r)^uRi{r) + (j)uRi{r)ij)yRi{r)]mRi^L}. 



( 11 ) 
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In Eq. (9) the contour must enclose the occuped valence states, while 
and in Eqs. (10) and (11) denote the partial wave and its first and second 
energy derivatives, respectively, evaluated at the energy An iteration is 

completed by solving Poissons equation for the electrostatic potential and adding 
the exchange-correlation potential. In addition, in contrast to the conventional 
treatment of an alloy problem in the framework of the CPA, adding corrections 
to the ASA becomes meaningful within the LSGF scheme. This will be described 
in the next section. The entire procedure is then repeated until self-consistency. 

2.4 Multipole Corrections to the ASA 

A supercell approach allows one to calculate the electrostatic contributions to 
the onc-clcctron potential and energy exactly by performing the Madclung sum- 
mation, because the atomic sphere charges in that case are calculated explicitly 
[12,13,8]. However, in the ASA, the electron charge is usually assumed to have 
spherical symmetry inside each atomic sphere. This approximation works quite 
well if all the atomic positions have high-symmetry local coordinations. When 
the local symmetry is broken, this approximation becomes less appropriate, and 
calculations based on the atomic sphere approximation sometimes lead to sub- 
stantial errors when calculating physical properties of solids. For example, the 
vacancy formation energy is usually overestimated by as much as a factor of 2 
[45-47], and the work function is about 50 % higher than experimental values 
[29,4]. 

The problem of Madelung contributions naturally emerges if one divides the 
crystal space into atomic or muffin-tin spheres and tries to describe the elec- 
trostatic interaction between the interior of a sphere and the rest of the crystal 
by adding a certain, constant throughout the sphere, electrostatic shift to the 
one-electron potential. However, if one considers, for example, an isolated va- 
cancy in a metal matrix, then one sees that vacancy itself has a very symmetrical 
local coordination in the case of a cubic or hexagonal crystal structure of the 
metal matrix. But the local coordination of its close neighbors is very unsym- 
mctrical because one atom has been removed to form the vacancy. On each of 
the vacancy neighbors a dipole moment of the electron charge should appear as 
a result of the fact that atomic density tails penetrate into this atomic sphere 
from all of its neighbors except the vacant site. An additional monopole poten- 
tial shift on the vacant site must be thus induced by the dipole moments of the 
surrounding atoms. It is clear, that this contribTition is omitted if the spherical 
approximation for the electron density is used. 

This simple example illustrates the importance of taking into account mul- 
tipole Madelung contributions to the potential and energy when dealing with a 
system in which the symmetry of the local coiihgurations is low. Otherwise, the 
system can be adequately described by only the monopole Madelung terms. It is, 
of course, also important to examine the relative contributions of the multipoles 
higher than the dipole, but it is clear that the dipole term is dominant in the 
case of a vacancy as well as for surface problems. 
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Probably the most consistent way of dealing with this problem would be 
in the framework of a full-potential approach. However, as has been shown in 
several works [29,4,48,49,30], one can expect to be able to determine an accurate 
total energy of a system by still using a spherical potential provided the spherical 
approximation to the electron charge density is lifted. The inonopole (L = 0) 
Madelung contribution to the one-electron potential is then evaluated using the 
monopole as well as multipole components of the valence electron charge, Q^, 
as: 



R',L' 

where is the multipole Madelung matrix which is equivalent to the con- 

ventional (unscreened) LMTO structure constants for the entire supercell, and 
S is the average Wigner-Seitz radius in all space. 

On the other hand, the total energy is calculated including all possible el- 
ectrostatic interactions between the multipole charges (monopole - monopole, 
monopole - dipole, dipole - dipole, etc.): 

= ^E E (13) 

R,L R',L' 

In Eqs. (12) and (13) multipole moments of the charge density are calculated 
as integrals over the atomic sphere with the origin taken at R: 
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ZrSi^o- (14) 



Here Sr, riR(r) and Zr are the atomic sphere radius, valence (nonspherical) 
electron density and valence number, respectively, of the atom at the site R, 
YL(f) is a real harmonic, and Cl,l'.L" f^re real-harmonic Gaunt coefficients. 
Radial parts of the valence density, /rl'L"(^), are obtained using the second- 
order Taylor expansion of the partial wave (10) as 



IrL'L" = <('(/R;'</>!/R;"Wrl'L" + 4’t^Rl'<PiyRl"'m'RL'L" 

+ ^i/R;"/'iyR;"RiRL'L" + ')'!^Ri'0i^R/"^RL'L" (15) 

+ -<Pi/Rl'(puRl" ^RL'L" + ~^uRl'4>uRl" '^RL' L" ■ 

In the last expression, the radial dependence of /rl'L"(^) and the expansion 
coefficients 4>vRi{‘>’) has been omitted. In Eq. (15) the density of states moments. 
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given by Eq. (9). In this regard we remark that if the angular mo- 
mentum cutoff for the Green’s function is equal to Imax, then multipole moments 
of the charge density, Eq. (14), have nonzero components up to 2lmax due to the 
properties of Gaunt coefficients. 



3 Taking Advantages of Tight-Binding LMTO 
Representation 

All equations presented in Sec. 2 are formulated for a general LMTO represen- 
tation a. Indeed, in the case of a bulk alloy without local lattice relaxations 
LSGF calculations can be carried out in any representation with the same com- 
putational efforts. However, when in addition to a substitutional disorder the 
ideal three-dimensional symmetry of an underlying lattice is also broken, a for- 
mulation of the method in terms of tight-binding (TB) LMTO basis set can give 
one a substantial advantage. Below we will consider two such cases, that is we 
will present generalization of the LSGF method to the case of (i) a surface alloy 
and (ii) an alloy with local distortions of atomic positions from the sites of ideal 
underlying lattice (local lattice relaxations). 

3.1 Electronic Properties of Surface Alloys: the LSGF Method 

The large interest in theoretical investigations of surfaces of transition metals 
and their alloys is motivated by its fundamental scientific value, as well as by 
the great practical importance of these systems. The properties of solid surfaces 
and interfaces between two metals play an important role in such phenomena 
as catalysis, chemisorbtion, adhesion and corrosion, just to mention a few ex- 
amples. Recently there has been significant progress in ab initio calculations 
of the electronic structure of surfaces of ordered materials using first-principles 
techniques. It appears that one may calculate surface related properties such as 
surface tension and work function with a high degree of accuracy [29,4]. In this 
context, random metallic alloys represent a large class of important materials. 
As to now the surface properties of these systems have only been investigated 
in a few cases. 

Let us consider for simplicity a system of N atoms with an ideal surface, i.e. 
without any imperfections, like, for example, relaxation, reconstruction, steps, 
etc. (generalization of the method for the latest case is trivial). The system is 
schematically illustrated in Fig. 3. In general, there are at least two ways of 
solving the electronic structure problem for this system in the framework of the 
LSGF method. Firstly, one can use a traditional approach (Fig. 3a), and model 
this surface by a supercell or a slab, thus coming back to a three-dimensional 
problem that is to be solved by a conventional LSGF method (see Fig. 3b) 
described in Sec. 2. Such a computational scheme will, of course, scale linearly 
with increasing number of atoms in the supercell, and in principle, it has been 
already successfully applied for several systems [28,50]. However, in doing so one 
has to construct an effective medium that represents on the average scattering 
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properties of the alloy components, as well as takes into account a vacuum region 
modeled in the framework of LMTO-ASA method by empty spheres. Giving the 
fact that these properties differ considerably from each other, one will expect a 
substantial increase of the LIZ size. 

At the same time, as has been demonstrated in Ref. [28], one of the advanta- 
ges of the LSGF method is the possibility to vary the effective medium to suit a 
problem at hand. Thus, in a case of surface alloy the better choice of the latter 
will be the one with a layer dependent Green’s function (Fig. 3c). This can be 
achieved in the framework of interface Green’s function technique, proposed by 
Lambrccht and Andersen [51] and developed by Skriver and Rosengaard [29]. 
An implementation of this technique in conjunction with the GPA method has 
been described by Abrikosov and Skriver [43]. 

Similar to these techniques, a solution to the problem of calculating the 
electronic structure of a surface or an interface is separated into two parts. 
First, one must calculate the Green’s function for the ideal bulk crystals on the 
both sides of the interface. In the case of a random alloy this has to be done 
by means of the LSGF method (Sec. 2) for a supercell which models the bulk 
alloy, i.e. without any presence of the surface or the interface. At the second 
stage one can construct an appropriate supercell to model a region of the alloy 
in the neighborhood of the interface. Note, that due to a local character of the 
perturbation induced by the surface or the interface, the size of this supercell 
along the direction perpendicular to the interface is not supposed to be too large. 
Proper boundary conditions will be ensured by using the bulk potential function 
from Eq. (4) for the effective medium outside the surface region obtained at the 
first stage. On the contrary, within the scheme illustrated in Fig. 3a and b, the 
whole problem must be solved for the same supercell which in this case should 
probably be quite large. 

Following the main idea behind the LSGF method, we construct an effective 
medium for each layer A in the neighborhood of a surface as in the case of a 
multicomponent two-dimensional alloy: 

/ dk||4^(k,|,0) 

-/ o HZ 

= + ( 16 ) 

R 

where Asbz is the area of the 2D surface Brilouin zone (SBZ), Nq and the 
number of atoms and the potential function of real atoms, respectively, at the 
sites 1? = (1?||, f2±) that belong to the same sublatticc of the 2D unit coll of the 
underlying surface lattice in the A layer (note that there can be more than one 
site 1? in the layer A), the potential function of an effective scatterer at i?, 
R e 1?, and (3 denotes the most localized, tight-binding LMTO-representation 
[32,33] which is essential to use in the calculation of the surface Green’s function 
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Fig. 3. Two ways of modeling surface alloy in the framework of the LSGF method, 
(a) One may employ a slab or a supercell approach, and repeat a part of the sample, 
including empty spheres {thin dotted circles) in all three directions, (b) An effective 
medium {gray circles) shall represent on the average scattering properties of real alloy 
components {filled and open circles), as well as empty spheres, and the embedding 
procedure can be carried out in a conventional manner, (c) By means of the interface 
Green’s function technique in conjunction with the LSGF method the surface alloy is 
treated in its true semi-infinite geometry, and a layer-dependent effective medium is 
introduced, (d) Embedding procedure can be carried out, and the size of the LIZ is 
supposed to be smaller than in the case of the slab or the conventional supercell 
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g|?^(k||,z). As usual in the case of surfaces, a good quantum number is k|| G 
SBZ, and the dependence of the surface Green’s function on ky is integrated off 
by means of the principle layer technique [52]. In doing so one (i) calculates the 
ideal Green’s functions for both sides of the interface using the unperturbed bulk 
potential functions of effective scatterers obtained for the supercell which models 
the bulk alloy, and (ii) solves a set of recurrent equations that glue the two sides 
of the interface together and take care of the perturbation of local potentials in 
the interface region. Note that this operation scales as 0{Na), where Na is a 
number of layers in the interface region. 

When the layer dependent effective medium is constructed by Eq. (16), i.e. 
is calculated for any sublattice 1? and for all layers A in the supercell that 
models surface alloy, the embedding procedure has to be carried out in a similar 
manner as for the bulk alloy. If we now denote sites of the LIZ as R = (R|| , R±), 
then the off-diagonal effective medium Green’s function for the LIZ is calculated 
as an integral over the SBZ 



= {Asbz)-^ [ (17) 

«/ SBZ 

and the Green’s function for the central site of the LIZ embedded into the effec- 
tive medium (Fig. 3d) is obtained by solving Dyson equation (6). The remaining 
procedure is analogous to the one presented in Sec. 2. 

3.2 Electronic Properties of Alloys with Local Lattice Relaxations 

In general, alloys are composed of elements which are not only chemically nonequ- 
ivalent, but also have different atomic sizes. As a result this size mismatch causes 
interatomic distancies in the alloy to be different, i.e. it causes so-called local 
relaxation effects. The system can also be looked upon as one where local posi- 
tions of all the atoms are moved from the sites of an ideal periodic underlying 
lattice (Fig. 4a). Though the recent systematic study of lattice relaxations aro- 
und a single impurity in Cu by Papanikolaou et al. [53] has shown that their 
contribution to the impurity solution energy in general is small compared to the 
values calculated earlier without lattice relaxations [3], this effect can influence 
the results for the density of states and the total energy calculations in some 
cases of very large size mismatches [54] , and it is important to be able to take 
the local relaxations into consideration. 

Within the LSGF method in conjunction with the TB-LMTO basis set the 
following procedure can be suggested. Firstly, we construct an effective medium 
for the undistorted underlying lattice in essentially the same way as has been 
described in Sec. 2. Then we embed a LIZ that is composed of an atom at the 
particular site and several shells of its nearest neighbors and that also includes 
all the relaxations in this effective medium (Fig. 4b). Thus, the Dyson equation 
that has to be solved now shall include both, the perturbation due to chemical 
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Fig. 4. Schematic representation of the LSGF method applied for an alloy with local 
lattice relaxations, (a) An original system composed of two chemically nonequivalent 
components (filled and open circles) can be looked upon as the system where local 
positions of all the atoms are moved from the sites of an ideal periodic underlying 
lattice (indicated by thin lines), (b) Effective atoms (gray eircles) are placed at the sites 
of the undistorted underlying lattice, and the Dyson equation (18) for the LIZ (dotted 
line) embedded into this effective medium accounts for two kinds of perturbations, due 
to the potential part, as well as due to structural perturbations 



disorder, as well as due to the fact that the atomic positions are shifted from 
the sites of the underlying lattice. In the framework of the TB-LMTO method 
this perturbation can be included in Eq. (6) through the difference between 
the real space structure constant matrices for the original system S and for the 
underlying lattice S as following 



g(^) = [(g(^))-i + (P(^) - P(z)) - (S - S)]-\ 



where bold symbols denotes (RL,R'L') matrices, and R, R' e LIZ. 



(18) 
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Note, that the only clement needed from a solution of Eq. (18) is again the on- 
site element gRL,RL'{z) for the central site of the LIZ. Therefore, the advantages 
of the TB-LMTO representation shall show up in full strength here. Not only it 
allows one to calculate easily structure constant matrices for the effective and 
the original lattices in the real space, bnt also dne to their localized character 
the difference between these lattices as it is seen from the central site of the 
LIZ vanishes exactly at very short distancies. As has been specified in Sec. 2 the 
above criterion is one of the major requirements to the effective medium that 
ensures the minimal size of the LIZ. Therefore, though the LIZ size necessary 
to carry out calculations for systems with local lattice relaxations will increase 
compared to the unrelaxed case, this increase is not supposed to be too large. 

4 Summary 

The order- locally self-consistent Green’s function method has been formula- 
ted in the framework of the LMTO method in the tight-binding representation. 
The atomic sphere approximation has been used, but we also have shown how 
multipole corrections to the ASA can be easily incorporated in our formalism. 
The LSGF method employs two basic concepts, the local interaction zone con- 
cept, that is the multiple scattering problem is solved in real space for each atom 
surrounded by few shells of nearest neighbors, and the effective medium concept 
that provides appropriate boundary conditions for the multiple scattering pro- 
blem by embedding the LIZ into a self-consistent effective medium constructed 
in such a way as to describe as close as possible scattering properties of the origi- 
nal system beyond the LIZ boundary. The latter concept ensures a minimal size 
of the LIZ, and therefore enhances the efficiency of the LSGF method as com- 
pared to other 0(N) methods. The tight-binding representation of the LMTO 
method allows us to present generalization of the method to two important cases, 
a surface alloy, and an alloy with local lattice relaxations. 

Applications of the LSGF method to several problems of alloy theory have 
shown that it is an excellent tool for studing systems with an arbitrary distribu- 
tion of atoms on the sites of the underlying crystal lattice. The method allows 
one to include directly short range order effects when calculating the electronic 
structure and the total energy of random alloys. In particular, mixing energies 
of fee Gu-Zn alloys calculated by the LSGF method for alloys with appropriate 
amount of short-range order were found to be in much better agreement with 
experiment than those where short-range order was neglected [8]. Moreover, the 
LSGF method takes into account all the local environment effects. Thus, it provi- 
des one with an atomic scale resolution when analyzing the electronic properties 
of materials, for example, spectral properties [37] or local magnetic moments 
[55]. 

The LSGF method gives one the opportunity to calculate the so-called ef- 
fective cluster interactions which can be later applied in the framework of a 
statistical mechanical technique in order to study phase stabilities of alloys. In 
particular, one can extract concentration dependent effective cluster interactions. 
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This may be done by calculating the total energies of alloys with different sets of 
correlation functions but for some fixed concentrations of the alloy components 
and then by mapping these energies onto the corresponding cluster expansion. 
Since these calculations are fast one may perform a large number of them and 
thereby increase the accuracy of the interaction parameters obtained. The re- 
sulting effective cluster interactions will include all contributions to the total 
energy. Such calculations have been carried out with a great success by Simak 
et al. [9] for a multicomponent Cu 2 NiZn alloy. 

By means of the LSGF method one can also study properties of point defects 
and their clusters. In particular, in order to study a realistic metal-vacancy sy- 
stem, it is necessary to consider a large supercell in which the vacancies are well 
separated. To calculate vacancy-vacancy or vacancy-solute interaction energies 
in alloys or to study complex thermal defects in some intermetallics, very large 
supercells are necessary, so conventional methods of band structure calculations 
become extremely inefficient. Here our 0(N) technique has allowed us, for ex- 
ample, to perform a systematic study of the vacancy formation energy of the 3d, 
4d, and 5d transition and noble metals, and to discuss its variation through a 
transition metal series, as well as the effects of crystal and magnetic structure 
[30], and the interaction of vacancies with other defects [10]. 

In conclusion, we find in number of applications and numerical tests that the 
LSGF method in conjunction with the TB-LMTO basis set leads to a reliable 
description of electronic properties of alloys. In general it yields results in ex- 
cellent agreement with those obtained by alternative hrst-principles techniques, 
but becomes more efficient than the latter already for systems that contain 30 
to 100 atoms in the unit cell. Thus, the LSGF method is a powerful technique 
for solving different problems of materials science theory. 
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Sparse Direct Methods: An Introduction 



J. A. Scott 

Department for Computation and Information, Rutherford Appleton Laboratory, 
Chilton, Didcot, Oxon 0X11 ORA, England. (J.Scott@rl.ac.uk) 



Abstract. The solution of large-scale linear systems lies at the heart of many compu- 
tations in science, engineering, industry, and (more recently) finance. In this paper, we 
give a brief introduction to direct methods based on Gaussian elimination for the solu- 
tion of such systems. We discuss the methods with reference to the sparse direct solvers 
that are available in the Harwell Subroutine Library. We briefly consider large sparse 
eigenvalue problems and show how the efficient solution of such problems depends upon 
the efiicient solution of sparse linear systems. 



1 Introduction 

Sparse matrices arise in very many application areas, including such diverse 
fields as structural analysis, chemical engineering, surveying, and economics. A 
matrix is sparse if many of its coefficients are zero and there is an advantage 
in exploiting the zeros. In this paper, we present a brief introduction to direct 
methods for the solution of large sparse linear systems of equations 

Ax = h. (1) 

We arc concerned with methods that are based on Gaussian elimination. That 
is, we compute an LU factorization of a permutation of A 

PAQ = LU, 

where P and Q are permutation matrices and L and U are lower and upper 
triangular matrices, respectively. These factors are used to solve the system (1) 
through the forward elimination 



Ly = Pb 



followed by the back substitution 



Uz = y. 

The required solution x is then the permuted vector x = Qz. When A is sym- 
metric positive definite, it is normal to use the Cholesky factorization 

PAP'^ = LL'^. 

H. Dreysse (Ed.): Workshop 1998, LNP 535, pp. 401-415, 1999. 
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For more general symmetric matrices, the factorization 

PAP'^ = LDL^, 

is more appropriate. For a stable decomposition in the indefinite case, the matrix 
D is block diagonal with blocks of order 1 or 2, and L is unit lower triangular. 

Several different approaches to Gaussian elimination for sparse matrices have 
been developed. They can each be divided into a number of phases: 

1. Preordering to exploit structure eg preordering to block triangular form 





/Hn \ 




B 21 B 22 


PAQ = 


B31 B 32 B33 




\Bni Bn2 ... Bnn j 



so that only the diagonal blocks Bu need to be factorized. 

2. Analyse - the sparsity pattern of A is analyzed to produce a suitable ordering 
and data structures for efficient factorization. 

3. Factorize - the numerical factorization of A is performed. 

4. Solve - the factors are used to solve one or more systems (1) using forward 
elimination and back substitution. 

Some codes combine the analyse and factorize phases so that numerical values 
are available when the ordering is being generated. Phase 3 (or the combined 
phase 2 and 3) generally requires most computational time. If more than one 
matrix with the same sparsity pattern is to be factorized, the analyse phase only 
needs to be performed once. Thus in contrast to dense solvers, for sparse solvers, 
it is potentially much faster to perform subsequent factorizations. 

In the following sections, we introduce three approaches: the general ap- 
proach, frontal methods, and multifrontal methods. We illustrate these different 
approaches using software from the Harwell Subroutine Library (HSL) [I] and 
use numerical examples from a range of scientihe and industrial applications. A 
useful reference is the book by Duff, Erisman, and Reid [2] and for an extensive 
list of references to recent developments in the area, we recommend the review 
by Duff [3]. 

We end this section by noting that the order of a matrix that is considered 
large is a function of time depending on both the development of dense and sparse 
codes and advances in computer architecture. However, by today’s standards, 
A need not be very large for it to be worthwhile to exploit sparsity. This is 
illustrated in Table 1 (taken from Duff [4] ) , in which we compare the performance 
of the HSL sparse solver MA48 with that of the dense solver SGESV from LAPACK. 
The problems are all from the Harwell-Boeing Sparse Matrix Collection [5] . The 
experiments were performed on a single processor of a CRAY Y-MP vector 
supercomputer and the timings are given in seconds. 




Sparse Direct Methods 403 



Table 1. A comparison of timings on the CRAY Y-MP for MA48 and SGESV on Harwell- 
Boeing matrices. 



Identifier 


Order 


Number of 
entries 


MA48 


SGESV 


FS 680 3 


680 


2646 


0.06 


0.96 


PORES 2 


1224 


9613 


0.54 


4.54 


BCSSTK27 


1224 


56126 


2.07 


4.55 


NNC1374 


1374 


8606 


0.70 


6.19 


WEST2021 


2021 


7353 


0.21 


18.88 


ORANI678 


2529 


90158 


1.17 


36.37 



2 The General Approach 

The principal features of the general approach are: 

• sparse data structures are used throughout 

• numerical and sparsity pivoting are performed at the same time (phases 2 
and 3 are combined). 

The efficient implementation of techniques for handling sparse data structures 
is of crucial importance. The most common sparse data structure and the one 
used in most general-purpose codes holds the matrix by rows. All rows arc stored 
in the same way with the real values and column indices in two arrays with a 
one-to-one correspondence between the arrays, so that the real value in position 
k. say, is in the column indicated by the entry in position k of the column index 
array. A sparse matrix can then be stored as a collection of sparse rows in two 
arrays; one integer, the other real, both of length nz, where nz is the number of 
entries in A. A third "pointer” array of length n+1 (n is the order of A) is used 
to identify the position in the first two arrays of the data structure for each row. 
To illustrate this scheme, consider the following 4x4 sparse matrix: 

/ 6 0 1 0 \ 

2 4 0 -1 
0100 
\ 7 0 0 -3/ 

Storing this matrix as a collection of sparse row vectors we have 

column index 13241214 
value 6. 1. 4. -1. 2. 1. 7. -3. 

row pointer 1 3 6 7 9 



Note that the column indices within each row need not be held in order. The 
advantages of the scheme are that it is a simple and compact method of storing 
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the matrix and it is straightforward to access the matrix by rows. A disadvantage 
is that it is difficult to insert entries, which is needed in Gaussian elimination 
when a multiple of one row (the pivot row) of the matrix is added to the other 
(non-pivot) rows with different sparsity patterns. 

In general, the matrix factors L and U are denser than A. For efficiency, in 
terms of both storage and floating-point operations (flops), it is essential to try 
and restrict the amount of “fill-in” (that is, the number of entries in L and U 
that correspond to zeros in A). The rows and columns of A need to be ordered 
to preserve sparsity. For example, consider the symmetric matrix with sparsity 
pattern 



A = 



Choosing the pivots in order down 
is LL^, where L has the form 



L = 



/ X X X X x\ 

X X 
X X 
X X 
\x X 

the diagonal, the Cholesky factorization of A 
lx ^ 

X X 
XXX 
X X X X 
\x X X X X J 



Thus all sparsity has been lost. However, reordering the rows of A in reverse 
order 



PA = A 



/ X x\ 

X X 
X X 
X X 

\X X X X X J 



and L retains the sparsity of A 



L = 



( X \ 

X 

X 

X 

\x X X X X j 



A simple but effective strategy for maintaining sparsity is due to Marko- 
witz [6]. We motivate this strategy by considering the first step of Gaussian 
elimination for the matrix A partitioned in the form 



A = 



a aA \ 

) ■ 
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Assuming a is a suitable pivot choice, the first step in the matrix factorization 
is 



A = 



Ad 



a 

0 / 



where 

A(i)=A^°)-^. (2) 

Fill-in occurs when an entry in the rank-one matrix aettj is a zero in A® . 
Clearly the dominant cost in the update (2) is that of forming the outer product 
. This cost is proportional to the product of the number of nonzeros in 
Oc and the number of nonzeros in . At each stage of Gaussian elimination, 
Markowitz therefore selects as the pivot the nonzero entry of the remaining 

(k) 

reduced submatrix that minimizes this product. That is, a\j is chosen as the 
pivot for the fc-th stage to minimize 

(rf) - l)(cf) ~ 1), 

where and denote, respectively, the number of nonzeros in row i and 
column j of A^) = 

For stability, pivot candidates must also satisfy some numerical criteria. In 
particular, assuming row and column interchanges have been performed to bring 
the pivot candidate selected by the Markowitz criteria to the diagonal, the pivot 
is only acceptable if it satisfies the inequality 

l4fc I > “ 

where u is a preset threshold parameter in the range 0 < u < 1. The choice 
u = 1 corresponds to partial pivoting but, in general, this value is too restrictive 
and leads to a large number of pivots being rejected and to unnecessary fill-in. 
If u is chosen to be too small, instability can result. A common choice is u = 
0.1. Experience has shown that this value usually provides a good compromise 
between maintaining stability and preserving sparsity. 

In the Harwell Subroutine Library, the package MA48 of Duff and Reid [7] 
(and its complex counterpart ME48) is a general sparse solver which uses Marko- 
witz pivoting. It uses the numerical vahies in the analyse phase and its default 
value for the threshold parameter is 0.1. The code offers a “fast factorization” 
for matrices with exactly the same sparsity pattern as one that has already 
been factorized. One of the ways in which the code achieves high performance, 
particularly on vector or super scalar machines, is by switching to full-matrix 
processing and using Level 3 BLAS [8] once the matrix is sufficiently dense. MA48 
is frequently used as a benchmark against which new sparse solvers are judged. 

The strength of the general approach is that it gives satisfactory perfor- 
mance for many matrix structures and is often the method of choice for very 
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sparse unstructured problems. Some gains and simplifieations are possible if A is 
symmetric. In particular, the Markowitz ordering is replaced by minimum degree 

(k) 

ordering. At the fcth stage the pivot is chosen to be ab , where 



..(fe) 



minr 



(fc) 



and is the number of nonzero entries in row I of If the matrix is 

additionally positive definite, numerical pivoting is not needed. 



3 Frontal Methods 

Frontal methods have their origins in the solution of hnite-clement problems for 
structural analysis, in which the matrix is symmetric and positive dehnite. The 
method can, however, be extended to general unsymmetric systems and need 
not be restricted to finite-element applications (see Duff [9]). 

To describe the method, we assume that A is a sum of finite-element matrices 

m 

A = (3) 

where each element matrix has nonzeros only in a few rows and columns and 
corresponds to the matrix from element 1. The main feature of the frontal method 
is that the contributions are assembled one at a time and the construction 
of the assembled coefficient matrix A is avoided by interleaving assembly and 
elimination operations. An assembly operation is of the form 

aij 4= aij -1- a^j , (4) 

where is the {i,j)th nonzero entry of the element matrix A^’'\ A variable is 
fully summed if it is involved in no further sums of the form (4) and is partially 
summed if it has appeared in at least one of the elements assembled so far but 
is not yet fully summed. The Gaussian elimination operation 

aij 4= aij aij (5) 

may be performed as soon as all the terms in the triple product in (5) are fully 
summed. At any stage during the assembly and elimination processes, the fully 
and partially summed variables are held in a dense matrix, termed the frontal 
matrix. Assuming k variables are fully summed, the frontal matrix F can be 
partitioned in the form 

F = 

where Fn is a square matrix of order fc. Pivots may be chosen from anywhere 
in Til. For symmetric positive-definite systems, they can be taken from the 
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diagonal in order but in the unsymmctric case, pivots must be chosen to satisfy 
a threshold criteria. Assuming k pivots can be chosen, Fn is factorized as Luf/n. 
Then F12 and F21 are updated as 

F21 = F2iUii^ and F12 = L11F12 
and finally the Schur complement 

F22 = F22 — F21F12 

is formed. The factors Ln and Uu, as well as F12 and F21, are stored as parts 
of L and U, before further elements are assembled with the Schur complement 
to form a new frontal matrix. 

The power of frontal schemes comes from the following observations: 

• since the frontal matrix is held as a dense matrix, dense linear algebra kernels 
(in particular, the BLAS) can be used during the numerical factorization, 

• the matrix factors need not be held in main memory, which allows large 
problems to be solved using only modest amounts of high-speed memory. 

There are a number of frontal codes in the Harwell Subroutine Library: MA42 
(Duff and Scott [ 10 ]) and its complex counterpart ME42 are for general unsym- 
metric systems and MA62 (Duff and Scott [ 11 ]) is for .symmetric positive definite 
finite-element problems. Both codes use Level 3 BLAS in the innermost loop 
and optionally store the matrix factors in auxiliary storage. High level BLAS 
are also used in the solve phase, and this increases the efficiency when the codes 
are used to solve for multiple right-hand sides. 

On a machine with fast Level 3 BLAS, the performance of the Harwell frontal 
solvers can be impressive. This is illustrated in Table 2. Here MA42 is used to 
solve a standard finite-element test problem on a single processor of a CRAY 



Table 2. Performance (in Mflop/s) of MA42 on a standard test problem running on a 
CRAY Y-MP. 



Dimension of element grid 


16 X 16 


32 X 32 


48 X 48 


64 X 64 


96 X 96 


Max order frontal matrix 


195 


355 


515 


675 


995 


Total order of problem 


5445 


21125 


47045 


83205 


186245 


Mflop/s 


145 


208 


242 


256 


272 



Y-MP, whose peak performance is 333 Mflop/s and on which the Level 3 BLAS 
matrix-matrix multiply routine SGEMM runs at 313 Mflop/s on sufficiently large 
matrices, ft is important to realize that the Mflop rates given in Table 2 include 
all overheads for holding the factors in auxiliary storage. 

The performance of frontal methods, in terms of both the number of floating- 
point operations and the storage requirements, is dependent upon the order in 
which the elements arc assembled. Many of the proposed algorithms for element 
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ordering arc similar to those for profile reduction of assembled matrices (see, 
for example, Reid and Scott [12]). Routine MC63 by Scott [13] is a new element 
ordering routine in the Harwell Subroutine Library. 

For non-element problems, the frontal method proceeds by assembling the 
rows of the matrix one at a time. In this case, efficiency depends on the ordering 
of the rows. For matrices with a syinmctric or almost symmetric sparsity pattern, 
a prohle reduction algorithm using the sparsity pattern of the Boolean sum of 
the patterns of A and can be used. For highly unsymmetric problems, new 
algorithms for row ordering have recently been introduced by Scott [14], and 
implemented as routine MC62 in the Harwell Subroutine Library. 

4 Multifrontal Methods 

Although high Megaflop rates can be achieved by frontal solvers, there are several 
important dehciences with the method: 

• for some problems, many more flops are performed than are needed by other 
methods 

• the factors can be denser than those produced by other methods 

• there is little scope for parallelism other than that which can be obtained 
within the higher level BLAS. 

These problems can be at least partially overcome through the use of more than 
one front. 

The multiple front approach partitions the underlying “domain” into sub- 
domains, performs a frontal decomposition on each subdomain separately and 
then factorizes the remaining “interface” variables, perhaps by also using a fron- 
tal scheme. The strategy corresponds to a bordered block diagonal ordering of 
the matrix and can be nested. With judicious ordering within each subproblem, 
the amount of work required can be reduced and, since the factorizations of the 
subproblems are independent, there is much scope for parallelism. Preliminary 
results presented by Duff and Scott [15] are encouraging. 

Multifrontal methods are a further extension of the frontal method. In place 
of a small number of frontal matrices corresponding to the number of subdomains 
used, many frontal matrices are used. Each is used for one or more pivot steps, 
and the resulting Schur complement is summed with other Schur complements 
to generate a further frontal matrix. To illustrate this idea, assume A is a sum of 
finite-element matrices (3). Assuming the elements are assembled in the natural 
order 1 , 2 ,..., the frontal method uses the summation 

((..((aW -f A^21) + a[ 31) -t aW) -f ...). 

However, there are other ways in which the summation can be performed. One 
possible alternative is to sum the elements in pairs, and then sum the pairs in 
pairs, and so on 

((AW + A[21) + (Al^l + aW)) + ((Al^l + aI®1) + (AI^I + A^®!)) + ... 
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A judicious ordering of the summation can reduce the work involved in the facto- 
rization and the density of the resulting factors. This added freedom in the way 
in which the assemblies are organized gives the multifrontal method an advan- 
tage over frontal methods. By using a sparsity ordering technique for symmetric 
systems (usually based on minimum degree), the method can be used efficiently 
for any matrix whose sparsity pattern is symmetric or almost symmetric. The 
restriction to nearly symmetric patterns arises because the initial ordering is 
performed using the sparsity pattern oi A . The approach can, however, 
be used on any system. If the matrix is very unsymmetric (that is for many 
entries a^- 7 ^ 0 but aji = 0 ), numerical pivoting in the factorization phase can 
signihcantly perturb the ordering given by the analyse phase. This is much re- 
duced if the matrix is permuted to have a zero-free diagonal before the analyse 
phase and further gains can sometimes be obtained by permuting entries with 
large modulus to the diagonal. For further details, see Duff and Koster [16]. This 
enables the multifrontal method to perform well on a wide range of matrices. 

As in the frontal method, multifrontal methods use dense matrices in the 
innermost loops. There is, however, more data movement than in the frontal 
scheme, and the innermost loops are not so dominant. 

In the Harwell Subroutine Library, MA41 by Amestoy and Duff [17] is a mul- 
tifrontal code for non-element matrices while the MA46 code of Damhaug and 
Reid [18] is designed for element problems. For symmetric problems, routines 
MA27 and MA47 (and complex versions ME27 and ME47) arc available. 

Although sparsity pivoting is usually separated from numerical pivoting, a 
recent variant of the multifrontal approach due to Davis and Duff [19] combines 
the analyse and factorize phases. This approach is implemented as subroutine 
MASS in the Harwell Subroutine Library. 



5 A Comparison of Codes 

The performance of the codes MA42, MA41, MA48, and MASS is compared in Table 3. 
The timings are in seconds on a Sun ULTRA- 1 workstation. MA42 is used with 
the row reordering package MC62. The results show that no single code is clearly 
better than the others. The choice of code is dependent on the problem being 
solved. MA41 generally has the fastest factorize time for problems such as PORES 
3, which have a nearly symmetric structure. For problems that are far from 
symmetric in structure (for example, LHR14C), MASS is very competitive, while 
MA4S performs well when the matrix is very sparse (problem WEST2021). MA42 
has the advantage of requiring much less in-core storage than the other codes 
and this reduction in main memory requirement can mean that it is feasible to 
solve problems with the frontal code which cannot be solved using the other 
codes. 

Finally, we remark that the comparative behaviour of the codes in terms 
of timings is, to some extent, dependent on the computing environment. In 
particular, the performance of MA42 is impressive on vector machines and if out- 
of-core storage is used, its performance is signihcantly affected by the speed 




410 



J.A. Scott 



Table 3. A comparison of HSL codes on unsymmetric assembled problems (Sun 
ULTRA-1). 



Identifier 


Order 


No. of 
entries 


Code 


Factor Factor ops 
time (*10®) 


Storage 
(Kwords) 
In-core Factor 


BAYER04 


20545 


159082 


MA42 


10.1 


123.8 


32 


2333 








MA41 


14.9 


159.0 


2757 


1928 








MA48 


7.8 


15.7 


1130 


926 








MA38 


11.9 


55.7 


1557 


1138 


LHR07C 


7337 


156508 


MA42 


8.0 


48.7 


22 


936 








MA41 


11.2 


151.6 


2416 


1387 








MA48 


11.8 


56.3 


1553 


1253 








MA38 


9.1 


33.9 


1317 


936 


LHR14C 


14270 


307858 


MA42 


17.7 


129.1 


94 


2041 








MA41 


48.1 


315.2 


5025 


3498 








MA48 


24.4 


88.8 


2978 


2399 








MA38 


16.6 


63.5 


2201 


1728 


NNC1374 


1374 


8606 


MA42 


0.52 


5.3 


5 


137 








MA41 


0.58 


9.2 


209 


154 








MA48 


0.63 


5.0 


155 


135 








MA38 


1.03 


5.6 


184 


129 


PORES 3 


532 


3474 


MA42 


0.15 


0.3 


1 


19 








MA41 


0.06 


0.2 


33 


16 








MA48 


0.08 


0.9 


26 


18 








MA38 


0.11 


0.2 


34 


20 


WEST2021 


2021 


7353 


MA42 


0.32 


1.32 


2 


82 








MA41 


0.19 


0.35 


104 


47 








MA48 


0.10 


0.05 


44 


25 








MA38 


0.24 


0.06 


73 


43 



of the i/o. It is important to exploit machine characteristics, such as cache, for 
efficient implementation. Some experiments using other computing environments 
are reported on by Duff and Scott [20]. 
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6 Computing the Inverse of a Sparse Matrix 

Once the LU faetors of a matrix have been computed they ean be used to solve 
for any number of right-hand sides b. Some software, including MA42, allows the 
user to input multiple right-hand sides. BLAS 3 routines can then be exploited 
in the solve phase and this allows a single call with fc > 1 right-hand sides to 
be significantly faster than k calls with a single right-hand side (see Duff and 
Scott [10]). 

An important special case of more than one right-hand side is where the 
inverse is required, since this can be obtained by solving 

AX = I 

by taking columns of / as successive right-hand side vectors. If a sequence of 
problems with the same matrix but different right-hand sides b is to be solved, 
it is tempting to calculate the inverse and u.se it to form the product 

X = A~^b. (6) 

However, there is almost no occasion when it is appropriate to compute the 
inverse in order to solve a linear system. In general, the inverse of a sparse 
matrix is dense, whereas the L and U factors are usually sparse, so using the 
relation (6) can be many times more expensive than using the relation LUx = b. 

There are many applications where only specihed entries of the inverse of 
A are required. For example, the diagonal entries of A^^ may be needed. Since 
solving (6) computes the entries of A~^ a column at a time, the entire lower 
triangle of A~^ would have to be computed to obtain all the diagonal entries. 
This may be avoided using the algorithm of Erisman and Tinncy [21], which 
allows advantage to be taken of sparsity. Let Z = A^^ and suppose a sparse 
factorization of A 



A = LDU 

has been computed, where L and U are unit lower and upper triangular matrices, 
respectively, and D is diagonal. It can be shown that 

Z = D-^L-^ - {I -U)Z 

and 

Z = U^^D-l + Z{I - L). 

Since (/ — L) and (/ — U) are strictly lower and upper triangular matrices, 
respectively, the following relations hold: 

~ [{^ ~ ^ Jj 

^ij ^)]ij ^ ^ ^ A 
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Zii — C?i + [(-/^ — U)Z]ii, 

and 

Zii = dj + [Z{I — Ly\ii. 

Using the sparsity of L and U, these formulae provide a means of computing 
particular entries of Z from previously computed ones. Further details are given 
in the book by Duff, Erisman, and Reid [2]. 

7 Eigenvalue Problems 

Solution methods for large sparse linear systems of equations are important in 
eigenvalue calculations. Large-scale generalized eigenvalue problems of the form 

Ax = XBx (7) 

arise in many application areas, including structural dynamics, quantum chemi- 
stry, and computational fluid dynamics. In many cases, only a few eigenvalues are 
required (for example, the largest or smallest eigenvalues). Solution techniques 
involve iterative methods based on Krylov subspaces. Well-known approaches in- 
clude subspace iteration, the Lanczos method and Arnoldi’s method. The book 
by Saad [22] provides a useful introduction to numerical methods for large-scale 
eigenvalues. 

Before applying a Lanczos or Arnoldi method, we transform (7) into a stan- 
dard eigenvalue problem of the form 

Tx = 9x. 

Iterative eigensolvers rapidly provide approximations to well-separated extremal 
eigenvalues. T should therefore be chosen so that the sought-after eigenvalues 
of (A, B) are transformed to well-separated extremal eigenvalues of T that are 
easily recoverable from the eigenvalues of T. Additionally, because the iterative 
eigensolvers involve matrix-vector products and for large problems, these can 
represent the dominant cost, we need to select T so that y = Tv can be computed 
efficiently. 

A frequently used transformation is the shift-invert transformation 

Tsi{a) = {A-aB)-^B. 

The scalar a is the shift or pole. Because eigenvalues close to a are mapped 
away from the origin while those lying far from a are mapped close to zero, 
Tsi is useful for computing eigenvalues of {A, B) lying close to a. Performing 
matrix-vector products y = Tsiv is eqTiivalent to solving the linear system 



{A - aB)y = b, 



( 8 ) 
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where b = Bv. Thus the efficiency of the eigensolvcr depends on the efficiency 
with which linear systems can be solved. 

In recent years, a number of software packages have been developed for large- 
scale eigenvalue problems, including EB12 and EB13 in the Harwell SnbroTitine 
Library (Duff and Scott [23], Scott [24]) for unsymmetric problems, and the 
very general ARPACK package of Lehoucq, Sorensen, and Yang [25]. These 
codes use a reverse communication interface so that, each time a matrix-vector 
product y = Tv is required, control is returned to the user. This approach 
allows the user to exploit the sparsity and structure of the matrix and to take 
full advantage of parallelism and/or vectorization. Additionally, the user can 
incorporate different preconditioning techniques in a straightforward way. For 
shift-invert transformations, if a direct method of solution is used for the linear 
system (8), the LU factorization of {A — aB) need be performed only once for 
each shift. 

8 Brief Summary 

We have given a brief introduction to direct methods for solving large sparse 
systems of linear equations. Numerical examples have shown that there is no 
single method and no single code that is the best for all applications. The size of 
problem that can be solved using direct methods is constantly growing with the 
development of more sophisticated numerical techniques and advances in compu- 
ter architecture. However, direct methods cannot be used for really large systems 
(particularly those for which the underlying problem is three-dimensional). In 
this case, iterative methods or techniques that combine elements of both direct 
and iterative methods will have to be used. 



9 Availability of Software 

All the codes highlighted in this paper are written in ANSI Fortran 77 and 
all except ARPACK are available through the Harwell Subroutine Library. Any- 
body interested in using any of the HSL codes should contact the HSL Mana- 
ger: Scott Roberts, AEA Technology, Building 477 Harwell, Didcot, Oxfordshire 
0X11 ORA, England, tel. +44 (0) 1235 432682, fax +44 (0) 1235 432023, email 
Scott.Roberts@aeat.co.uk, who will provide licencing information. Academic li- 
cences are available at a nominal cost. Further information may also be found 
on the World Wide Web at http://www.dci.clrc.ac.uk/Activity/HSL. 

The ARPACK code is in the public domain and may be accessed at 
http : //www . caam . r ice . edu/sof tware/ ARPACK/. 

There is a limited amount of sparse matrix software that implements di- 
rect methods available within the public domain. Some codes can be obtained 
through netlib (http://www.netlib.org) and others from the Web pages of 
the researcher developing the code. The problem with the latter source is that, 
in general, there is no guarantee of quality control or of software maintenance 
and user support. 
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Further iiiforination on sources of software for sparse linear systems may be 

found in the recent report of Duff [4] . 
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Abstract. The basic ingredients of a real-space tight-binding linear-mTillin-tin orbital 
(RS-TB-LMTO) approach to non-collinear magnetism and to torque-force calculations 
of the magnetic anisotropy are described. Applications to face-centered-tetragonal Ni 
films epitaxially grown on Cu(lOO) substrates are presented. The tetragonal distortion 
of the films is calculated using an ab-initio local-density technique, and the RS-TB- 
LMTO method is used for calculating the magnetic anisotropy in films with up to 7 
Ni monolayers. The accuracy of the approach allows for a detailed analysis of second- 
and fourth-order anisotropy constants. 



1 Introduction 

The tight-binding linear mufhn-tin orbital method [1] has proven to be a very 
efficient technique for investigating the electronic and magnetic properties of 
complex materials, both in its reciprocal- and real-space forms. Exemplary ap- 
plications include disordered alloys [2,3], metallic glasses [4] and quasicrystalline 
alloys [5]. Of particular interest in the study of magnetism are systems where 
the magnetically ordered ground state cannot be described as a simple ferro-, 
antiferro-, or ferrimagnetic order with all moments aligned parallel or antipar- 
allel to the global axis of magnetisation. In disordered systems the competition 
between ferro- and antiferromagnetic exchange interactions and/or fluctuating 
local anisotropies can lead to the formation of a non-collinear ground-state de- 
scribable as a spin-glass, a spero-, speri-, or asperomagnet [ 6 j. Non-collinear 
magnetic structures can also arise as a consequence of uncompensated magnetic 
interactions in ordered intermetallic compounds. The symmetry criteria for the 
formation of non-collinear spin structures have been discussed in Ref. [7]. 

Techniques for solving the Kohn-Sham equations of local-spin-density theory 
for a non-collinear magnet have been implemented in various standard electronic 
structure codes: the augmented spherical wave (ASW) method [ 8 ], the LMTO 
technique [9,10], and empirical tight-binding [ 11 , 12 ]. Applications include the 
helical magnetic structures of 7 — Fe [9,13] and of YMu 2 [14], the non-collinear 
magnetism in MiisSn [ 8 ], in metallic glasses [3], in quasicrystals [15] and in spin- 
glasses [ 2 ], to cite only a few examples. 

The possibility to tilt the magnetic moment at a given site with respect to its 

equilibrium orientation opens the way to a calculation of Ising-, or Heisenberg- 

type exchange pair interactions, allowing even for a calculation of bilinear and 
H. Dreysse (Ed.): Workshop 1998, LNP 535, pp. 416-«3, 1999. 

© Springer-Verlag Berlin Heidelberg 1999 
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biquadratic exchange couplings [16,17], If spin-orbit coupling is included in the 
Hamiltonian, the magnetic anisotropy energy (MAE) can be calculated by a ma- 
gnetic torque-force approach [18-20], In both cases, the RS-TB-LMTO provides 
sufficient convergence whereas k-space calculations converge only when extre- 
mely fine grids are used for Brillouin-zone intergations [21,22], In addition, they 
have the potential to make the underlying physical mechanism more transparent 
and to allow the investigation of even very complex systems. 

In the present paper we first briefly review the fundamentals of the non- 
collinear spin-polarised RS-TB-LMTO technique and describe its application to 
the calculation of the magnctocrystallinc anisotropy and to the exchange cou- 
pling constants. In the second part we present detailed investigations of the 
magnetic properties of the fee Ni films epitaxially grown on Cu(OOl) surfaces. 
The Ni/Cu(001) system is unique because of the re-entrant character of the 
perpendicular magnetic anisotropy: with increasing film thickness, the magnetic 
anisotropy switches from in-plane to perpendicular at a thickness of about 7 
monolayers, retaining the orientation of the magnetic moments normal to the 
film plane for a thickness of up to 60 rA [23,24], In ’’normal” system the equili- 
brium between the spin-orbit driven anisotropy and the shape anisotropy leads 
to a single transition from perpendicular to in-planc with an increasing num- 
ber of monolayers. It is believed that the re-entrant behaviour of Ni/Cu(001) 
films is largely strain-induced, driven by the lattice mismatch between film and 
substrate. 



2 TB-LMTO Approach and Real-Space Recursion 
Formalism 

Our approach to the self-consistent electronic-structure calculation is based on 
the two-center TB-LMTO Hamiltonian 






Sss' - 



j-a.exch 






( 1 ) 



expressed in terms of the structure constants 5“ and the potential parameters c“, 

which are evaluated in the screened most-localised representation [25], The 
potential parameters depend on the solution of the radial Schrodinger equation 
at the energies chosen usually at the center of the occupied part of the bands. 
Essentially, c“ describes the center of gravity of the bands whereas d°‘ measures 
the band width. The matrix element given by Eq, (1) refers to the interaction 
between atoms i, i' , orbitals I, V of the spin s, s'. The Pauli matrices will be 
denoted as cr’”, and . 

The first term in of Eq, (1) describes the non-magnetic part of the band 
structure, the second spin-dependent term gives rise to the shifts of the bands 
with different spins in the opposite directions. The shift is controlled by the 
exchange splitting field 
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4i,7 = 



Ci4 ~ 



(2) 



Self-consistent calculations for various magnetic systems containing 3d- or 
4d-metals reveal that the proportionality relation 



Ail = hmri , ( 3 ) 

between the exchange splitting field An and the magnetic moment mu is satisfied 
very well for d-orbitals [4,26,27]. Therefore the non-selfconsistent studies based 
on the TB-LMTO formalism can use a fixed Stoner parameter of /2 = 0.95 
eV / as a very reasonable approximation. 

Till now it was supposed that the magnetic moments arc all aligned along 
the z axis. The generalization of the presented approach to the treatment of 
a non-collinear magnetic order consists in rewriting the exchange part of the 
Hamiltonian (1) to a rotation invariant form 

^iisu^vs' = -]^5iu5u'Auass' = -]^5u'5wIuAu'n.^.(Jsa' , (4) 

where <t = u® x + a*' y + cr^ z is the vector of the Pauli matrices with x, y, 
z the unit vectors spanning a global coordinate space and rij = mi/jmij. Each 
magnetic moment direction defined by polar angles ipi and %Ji with respect to 
the global coordinate system defines the moments’ local coordinate system, in 
which the exchange part of the Hamiltonian keeps the form of Eq. (1) but with 
= erf referring to the local coordinate system. Because the paramagnetic part 
of the Hamiltonian is constructed in the global coordinate system, the on-site 
exchange part must be transformed correspondingly for each atom. Of course, the 
opposite procedure of the transformation of the paramagnetic matrix elements, 
namely the structure constants into the local bases would be equivalent. Taking 
rii = cos ifi sin x + sin tpi sin i?! y + cos di z we obtain for the transformed 
exchange splitting field on the ?'th site 



c^^iiAiiHicr — ^liiAii 
1 



cos 79 i sin 79 i exp (—iipi) 

sin 79i exp (i^i) —cos79i 



= -^IuAuB((fii,79,)crfD^((fii,79i) 



( 5 ) 



D{(pi,'di) is the Wigner s = | rotation matrix from the local coordination 
system to the global one 



( cos ^ exp 
sin ^ exp 



— sm exp 
cos exp 







( 6 ) 
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The band-structure problem is solved using the real-space recursion method 
[28], which is very efficient in combination with the TB-LMTO Hamiltonian. 
Therefore complex systems with many degrees of freedom can be treated. 

The determination of a non-collinear magnetic structure proceeds as follows. 
In the first step the ground state of the collinear magnetic arrangement is found 
for the Hamiltonian (1). The potential parameters c“, together with the Sto- 
ner parameters la calculated from Eqs. (2) and (3) are used in the construction 
of the Hamiltonian for the non-collinear calculation with the exchange part due 
to Eq. (5), so that all the procedure is parameter-free. The starting magnetic mo- 
ment vectors arc distributed randomly or small random transversal components 
are added to the magnetic moments resulting from previous collinear calculation. 
Partial densities of states are obtained for the x-, y-, z-directions of a moment 
in its local basis by choosing as the starting recursion vector the normalized 
eigenvector of cr“, cr^, cr^, respectively, for up and down spin directions. The 
starting recursion vector is then rotated to the global coordinate system by mul- 
tiplying it with D[ipi,'di) from Eq. (6). In general, the new magnetic moments 
obtained from the integrated projected densities of states will have transversal 
components with respect to the last moment directions. The partial densities 
of state along the direction of the magnetic moments arc used in the update of 
the charge densities, potential parameters and exchange splitting helds at each 
step. The process continues in an iterative way until the transversal components 
are sufficiently small. The moment rotations are quite slow during the iteration 
process, therefore the new orientations are extrapolated from the old and new 
directions and besides random noise components are added in order to avoid 
running into nearest local minima. In the prediction of the new non-collinear 
structure we use a Broyden mixing scheme [29]. For further technical details of 
the non-collinear calculations we refer to the papers [10,19,30]. 

In some cases symmetry restrictions allow only a few special spin arrange- 
ments in a system. Then the aim is to find a ground state spin configuration. 
This kind of calculations can be done as described above but only the densities of 
state projected along the moments are necessary what results in the much faster 
calculation. To the group of models with hxed directions of magnetic moments 
belongs a determination of the magnetic anisotropy energy. 

2.1 Magnetocrystalline Anisotropy 

The spin-orbit coupling responsible for the magnetocrystalline anisotropy can be 
included into Hamiltonian given by Eq. (1) and Eq. (4) by adding an intra-atomic 
term 



HI 



ls,i'l'a' ~ ^^W^ils,i'l's'{E) Us^i' I' s' 



(7) 



The matrix elements | for the d-orbitals in the frame rotated 

to the magnetic moment direction can be found in Ref. [31]. The spin-orbit 
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Table 1. Spin-orbit coupling parameters in meV for iron, cobalt and nickel calculated 
at their experimental lattice constants for d-bands at the Fermi level Ef and at the 
middle of the occupied parts of the d-bands t^,. The results obtained by LSDA exchange- 
correlation potentials are compared with those obtained by GGA exchange-correlation 
potentials (in parentheses). 





id-riEp) 


idiiEp) 


5dt(e./) 




Fe 


72(72) 


56(55) 


58(58) 


45(45) 


Go 


94(94) 


78(78) 


73(74) 


63(63) 


Ni 


109(111) 


103(103) 


91(88) 


84(84) 



parameters !^as,i'i's'{E) are assumed to be non-iiegligible only between the d- 
orbitals centered on the same site and they are given in Ry units [32] as 



Here c is the velocity of light, 4>us{E, r) are the radial partial waves calculated 
at the energy E and V (r) is the onc-clcctron interaction potential. The spin-orbit 
coupling parameters obtained for iron, cobalt and nickel are shown in Table 1. We 
have found that using the Barth- Hedin-Janak local-spin-density approximation 
(LSDA) [33,34] and the generalised gradient approximation (GGA) [35] results 
for the spin-orbit coupling differing less than 3 %. 

Because the spin-orbit coupling constants are much smaller than the band 
width for 3d-metals the magnetic anisotropy energy is often evaluated as a diffe- 
rence of the sums of the single-particle eigenvalues for the opposite spin directions 
treating the spin-orbit term (7) as a perturbation. Even then the calculations 
in the k-spacc are very laborious [21]. Recently it has been demonstrated in 
several papers that the real-space approach makes the task of the MAE estima- 
tion possible in a non-perturbative fashion from the total ground state energies 
[19,20,36]. From practical reasons the inclusion of the spin-orbit coupling in the 
non-collinear calculations has the advantage of reducing somewhat the drift of 
the overall magnetic moment in the course of the iteration process. 

The determination of the preferential magnetisation orientation is of much 
interest especially for thin magnetic films and multilayers, which possess a lowe- 
red symmetry. However, in the layered systems the other significant contribution, 
the magnetostatic shape anisotropy coming from the dipole-dipole interaction, 
must be taken into account. Because the shape anisotropy always prefers the 
in-plane magnetisation, it is responsible for the changing the orientation of the 
magnetisation to the plane at some critical thickness if the spin-orbit contri- 
bution to the MAE happens to support a perpendicular anisotropy. When the 
thickness of the magnetic film is reduced to a few monolayers, the contributions 
from all discrete dipole pairs have to be summed up explicitly 
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^dip 



1 







( 9 ) 



rather than to resort to the continuum approximation. The sums appearing in 
Eq. (9) converge slowly due to the long-range character of the dipole-dipole 
interaction but they can be efficiently evaluated in the reciprocal space [37]. For 
the cubic sc, bcc and fee lattices with one atom type per layer the magnetostatic 
dipolar energy can be expressed in Ry units as 



Edip 



1 



E 

<i,3> 



rriirrij (cos cos 'dj — 



- - cos 



) sin r?, sin t? j ) Mij 



(10) 



where U 2 d means a number of atoms in one layer, a is the lattice parameter of 
the basic cubic cell and the Mij are the dipolar Madelung constants tabulated 
in Table 2. 



2.2 Exchange coupling constants 

The modern spin-polarised band theory gives an accurate description of the ma- 
gnetic ground state of most metals and alloys. The magnetic excitations from the 
ground state are described in terms of various spin models in which the strength 
of a pair interaction is controlled by a magnitude of the exchange coupling. 

Recently we have derived expressions for the exchange pair coupling con- 
stants and some other related quantities within a real-space approach [16]. The 
exchange interaction between the ith and jth moments takes a form 

= l^G]]{E)G]\{E)dE , (11) 



Table 2. Dipolar Madelung constants for the sc, bcc and fee lattice geometries provided 
the basic cubic cell of unit volume, z stands for the interlayer distance. 



layer 


z 


SC 


bcc 


fee 


0 


0 


9.03362 


9.03362 


25.55094 


1 


1 

2 


— 


4.17639 


4.04301 


2 


1 


-0.32746 


-0.32746 


-0.06402 


3 


3 

2 


— 


0.01238 


0.00072 


4 


2 


-0.00055 


-0.00055 


0.00001 




422 Daniel Spisak and Jurgen Hafner 



where A is the exchange splitting field defined by Eq. (2) and the off-site Green’s 
functions Gfj{E) for the up and down spins are calculated by the recursion 
method using the bonding and antibonding combinations for the sites i and j. 

The mean-field estimation of the Curie temperature is related to the on-site 
exchange coupling Ju via 



Tc,i = -7^ . (12) 

The critical temperature Tc,i should be viewed as a measure of the local stability 
of the magnetic state of the ith atom surrounded by all other atoms. 

Another quantity related to the exchange couplings and accessible to the 
experimental verification is the spin-wave stiffness constant 



Hi — . (13) 

3 

Here Vij are the lattice vectors. The results for the nearest and the next nearest 
exchange couplings, the Curie temperature and the stiffness constants for iron, 
cobalt and nickel arc presented in Table 3. The stiffness constant for iron was 
calculated using the first 14 terms, for cobalt and for nickel the first 17 terms in 
Eq. (13) were taken into account. The overall agreement with the experimental 
data confirms that the spin models formulated originally for systems with loca- 
lised magnetic moments can be still be used as a reasonable approximation for 
itinerant magnets. 

3 Ni/Cu(001) Films 

3.1 Atomic Structure 

Recently we have witnessed extensive experimental [24,42-47] and theoretical 
investigations [48-50] of the Ni films grown on Cu. The Ni/(001)Cu system has 
an average lattice mismatch of only 2.6 % favoring a coherent growth of Ni on 



Table 3. The nearest and the next nearest exchange couplings Ji, J 2 , the experimental 
and the calculated values of Curie temperature Tc and the spin-wave stiffness constant 
D for iron, cobalt and nickel. 





Ji (meV) J 2 


(meV) Tc 


. (K) 


(K) D (meVrA^) 


(meVrA^) 


Fe (bcc) 


16.27 


17.29 


890 


1044“ 


280 


280*' 


Co (hep) 


25.05 


4.11 


1000 {p Co) 1388= 


1900 


580*' 


Ni (fee) 


4.62 


0.20 


290 


627“ 


530 


555“* 


“ Ref. [38], 


, Ref. [39], " 


Ref. [40], 


Ref. 


[41] 
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Table 4. The relaxed atomic structures of films with one to seven Ni layers on Cu(OOl) 
from first-principles calculations using the VASP package. The results are compared 
with the available experimental data taken at room temperature. 



ML 


1 


2 




3 




5 




7 


fct Ni 


dl2(%) 


-6.2 -7.0 


-11.9 -2.4 


-11.4 


-3.5 


-1.1 


-10.8 


1.6 


-11.0 


-11.1 


d2a(%) 


0.2 -1.0 


-1.9 -1.0 


-7.4 


-7.5 


-3.9 


-5.1 


-6.1 


-5.8 


-6.2 


di4,{%) 




0.5 -0.5 


-3.2 


0.4 


-2.8 


-5.8 


-5.0 


-5.9 


-6.9 








0.0 


6.6 


-1.1 


-6.8 


-5.6 


-5.3 


-7.0 


dh&{%) 












-1.2 




-6.0 




deriVo) 












1.7 




-6.2 




dTs{%) 
















-0.7 




ds9{%) 
















3.0 




Ref. 


[45] 


[45] 




[45] 


[46] 




[46] 







Cu in an artificial tetragonally distorted face centered cubic (fct) structure up 
to a thickness of about 40 layers above which the growth continues in the fee 
structure. 

We investigated first the relaxation of films with one to seven Ni layers on 
Cu(OOl) using the spin-polarised version of the Vienna ab-initio simulation pack- 
age (VASP). A detailed description of the VASP and its algorithms can be found 
in Ref. [51]. We used the Ceperley and Alder [52] local spin-density functional 
and the generalised gradient approximation corrections [35] to the exchange- 
correlation energy. 

In the first step of the calculation the equilibrium lattice spacing of bulk Cu 
was found 3.637 rA, in close agreement with the measured value 3.61 rA. Then 
we performed a geometry optimisation of the Cu surface for a slab with 6 layers. 
We found a 3.6 % inward relaxation of the surface layer. The predicted relaxation 
agrees well with previous ab-initio calculations [53], but is somewhat lower than 
the relaxation found in experiment (1.2 % in Ref. [54], 2.4 % in Ref. [55]). The 
difference is mostly due to the fact that experiments have been performed at 
room temperature. Afterwards the slab was extended on one side with 1 to 7 
Ni layers and during the relaxation the lateral lattice spacing was kept at the 
Cu bulk value. The inspection of the obtained layer relaxations summarised 
in Table 4 reveals clear trends. The surface layer undergoes a strong inward 
relaxation, the subsurface layers form a fct lattice with an axial ratio c/aw 0.94. 
The relaxation of the bulk fct Ni with the lattice spacing of the Cu bulk leads 
to a tetragonal distortion of c/a w 0.93. The rightmost column of Table 4 gives 
the structure of the fct Ni surface modeled as a slab of eight layers. Again the 
surface layer relaxes inward by a 11 % and the c/a ratio reaches the value 0.93 
in the middle of the slab. The estimate within the continuum elasticity theory 
of coherent epitaxy-induced structural changes gives c/a w 0.965. 
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Table 5. Magnetic moments in relaxed Ni/Cu(001) films with up to seven monolayers 
of Ni and of fct Ni in hb- First two interface Cu layers are shown as well. The results 
within RS-TB-LMTO approach can be compared with the moments obtained by VASP 
package (right columns). 



ML 


1 




2 




3 




5 




7 




fct Ni 


1 


0.12 


0.38 


0.76 


0.75 


0.78 


0.74 


0.77 


0.74 


0.74 


0.73 




2 


0.00 


0.01 


0.48 


0.52 


0.55 


0.59 


0.58 


0.68 


0.58 


0.68 




3 


-0.00 


-0.01 


-0.01 


-0.01 


0.50 


0.47 


0.69 


0.71 


0.68 


0.69 




4 






-0.00 


0.00 


-0.01 


0.00 


0.62 


0.69 


0.64 


0.68 




5 










-0.01 


0.00 


0.48 


0.58 


0.68 


0.71 




6 














-0.01 


0.01 


0.64 


0.70 




7 














-0.01 


0.00 


0.47 


0.58 




8 


















-0.01 


-0.01 




9 


















-0.01 


-0.01 




m 


0.12 


0.38 


0.62 


0.63 


0.61 


0.60 


0.63 


0.68 


0.63 


0.68 


0.66 0.61 



For a single monolayer of Xi/Cu(001), the predicted relaxation is in very good 
agreement with low-energy electron-diffraction (LEED) experiments by Kim et 
al. [45]. For thicker layers, however, LEED experiments predict only a minimal 
inward relaxation (and for yet thicker layers even an outward relaxation) of the 
top layer [45,46], at an almost homogeneous tetragonal distortion of the deeper 
part of the film. The first-principle calculations, on the other side predict a large 
inward relaxation ot the toplayer, but agree with experiment concerning the 
tetragonal distortion of the interior of the film. 

Recent ab-initio calculations (based on the same technique) of the structu- 
ral, electronic and magnetic properties of all low index surfaces of Ni [56] lead 
to excellent agreement with experiment. Similar discrepancies between ab-initio 
calculation and the experiment as those observed for Ni/Cu(001) have been re- 
corded for a number of transition metal surfaces, e.g. Rh(OOl) and attributed to 
an anomalously large perpendicular thermal expansion at the surface [57], limit- 
ing the comparison between the room-temperature experiment and the T = 0 K 
calculation. It must be left to the future work whether this mechanism also ex- 
plains the discrepancy between calculations and experiment for the surface of 
Ni/Cu(001) films. 

3.2 Magnetic Structure 

Using the relaxed atomic structural models discussed in the previous paragraph 
we have calculated the magnetic structure and the magnetic anisotropy energy 
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Table 6. Spin-orbit coupling parameters for up (left columns) and down (right co- 
lumns) partial waves in the relaxed Ni/Cu(001) films with up to seven layers of Ni in 
meV. The results for bulk fct Ni are shown in the right-most column. 



ML 1 2 3 5 7 fct Ni 



1 


97.3 95.9 107.2 


98.3 


106.8 


97.7 


106.6 


97.6 


106.5 


97.8 


2 


105.7 


102.6 


111.6 


104.5 


111.3 


103.8 


111.8 


104.3 


3 






107.1 


101.0 


111.1 


102.3 


111.4 


102.7 


4 










110.8 


102.9 


110.7 


102.6 


5 










106.6 


100.7 


111.3 


102.7 


6 














111.4 


103.2 


7 














106.7 


100.7 



110.2 101.9 



of the Ni/Cu(001) films within the framework of RS-TB-LMTO method. The 
real-space recursion technique was applied to cells with 1944 atoms for 1 Ni 
monolayer (ML) up to 2560 atoms for 7 Ni ML. Periodic boundary conditions in 
the lateral directions and the free boundary conditions normal to the layers were 
used. In all cases 20, 20, and 50 recursion levels were used for the s-, p-, d-orbitals, 
respectively. Because for the thicker Ni films we obtained a systematic inward 
relaxation 6 %, we studied also an infinite fct Ni crystal with the tetragonal 
distortion c/a = 0.94. 

The layer-resolved and the average magnetic moments are shown in Table 
5. As can be expected, the moments at the surface are enhanced (except for the 
monolayer) while the moments at the interface drop. This behaviour is observed 
independent of the film thickness. Although the total average magnetic moment 
approaches the bulk value gradually, even the system with 7 Ni ML is influenced 
so strongly by the surface that it is not possible to identify a subsurface region 
with steady bulk-like magnetic moments. We also note that the RS-TB-LMTO 
calculations agree well with k-space results obtained using VASP. 

Due to the relatively low Curie temperature of bulk Ni, the thin films of 
Ni can be studied in a wide temperature range as a function of film thickness. 
The comprehensive collection of experimental data establishes a clear picture 
of magnetic anisotropies in Ni/Cu(001) hlms [24,43,44]. As the film thickness 
increases the magnetisation changes its direction from [100] to [110] between 6 
and 7 Ni ML and between 7 and 8 Ni ML it switches continuously to the [001] 
directioir. The surface and volume contributions to the second-order anisotropy 
constants AT| and s-t zero temperature lie between -100 up to -180 p-eV/atom 
and 40 up to 75 peV/atom [44], respectively. The large error bars are due to 
the uncertainty in the extrapolation down to zero temperature. For the fourth- 
order in-plane and out-of-plane anisotropy constants very small values of « 
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— 1 /rcV/atoin and w 0.2 /xeV/atom at T/Tj. = 0.8 were reported for 7 Ni 
ML [44]. 

We calculated MAE using the force theorem [21], treating the spin-orbit 
coupling as a perturbation. As can be seen from Table 6 the spin-orbit coupling 
parameters are only slightly affected by the surface and interface. The layer- 
resolved and the total band and dipole-dipole contributions to the MAE are 
compiled in Table 7. The contributions of the surface layer to the MAE in all 
but monolayer films support an out-of-plane orientation of the magnetisation 
and the magnitude of these contributions saturates with the film thickness. The 
contribution of the Ni layer at the interface to the Cu substrate also prefers a 
perpendicular orientation of the magnetic moments (except for the 2 ML him), 
but does not show a systematic variation as the number of layers increases. Quite 
surprisingly, we also hnd a large contribution from the subsurface layer and the 
second Ni layer from interface, always preferring an in-plane orientation. The 
contributions from the inner layers in films with > 5 ML are always smaller than 
the surface and interface contributions. As can be seen from Fig. 1, the 2 ML and 
3 ML hlms show the perpendicular magnetic anisotropy, all other hlms we have 
investigated have an easy axis in the plane. The spin-reorientation transition 
between the 3 ML and 5 ML case is driven by the subsurface contribution to the 
MAE. For the Ni monolayer wc predict K 2 = —263 ^ueV /atom in a fair agreement 
with the measured value about —157 p,eV/atom [43], previous calculations giving 
—94 /reV/atom [48] or —690 /xeV/atom [50]. For 2 Ni ML our result K 2 = 




Ni layers 



Fig. 1. Calcutated total magnetic anisotropy energy (circles) and its dipole-dipole 
(triangles) and band (squares) contributions for thin Ni films on Cu(OOl). 
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Table 7. Spin-orbit (AA’b, left columns) and dipolar (AA'd, right columns) contribu- 
tions to the magnetic anisotropy energy AE = Eioo ~ Eooi in the relaxed Ni/Cu(001) 
films with up to seven Ni layers. In the last row the estimate of the fourth-order ani- 
sotropy constant = 4(Aiio — Eioo) is given. The values for the layer-resolved 
contributions are in /reV, the averaged values are in /reV/atom. 



ML 


1 




2 


3 




5 




7 




fct Ni 


1 


-263 


0 


822 -14 


1511 


-15 


2046 


-15 


2202 


-14 




2 






-369 -7 


-2112 


-10 


-1036 


-11 


-1049 


-11 




3 








659 


-6 


355 


-14 


254 


-13 




4 












-1870 


-11 


-641 


-12 




5 












168 


-6 


-316 


-13 




6 
















-1330 


-12 




7 
















481 


-6 




Ai?b,d 


-263 


0 


227 -10 


19 


-10 


-67 


-11 


-57 


-12 


52 0 


A(i?b -l- Ed) 


-263 




217 


9 




-78 




-69 




52 


A411 


-331 




103 


24 




-17 




-6 




-3 



227 /reV/atom compares very well with K 2 = 300 /reV/atom obtained by Wu 
and Freeman [50] . We note that the earlier calculations considered unrelaxed fct 
or fee lattices. 

For an infinite fct Ni crystal with the tetragonal distortion c/a = 0.94 we 
performed a series of calculations for models with different number of atoms. 
We obtained almost the same values for the MAE 50, 54, 52 /reV/atom for cells 
with 2048, 2916 and 6912 atoms, respectively. These values are only a bit lower 
than results 60 fieV [49] and 65 /ueV [50] found in the k-space calculations. 

The decreasing negative values of the MAE for 5 and 7 ML, together with 
the positive value for the infinite fct Ni indicate the possibility of an in-plane to 
perpendicular reorientation at a thickness > 7 ML. However, it must be left to 
future studies to locate this transition precisely. 

The MAE for 2 Ni ML as a function of the tilt angle i? taken from the [001] 
direction is shown in Fig 2. The dependence on cos^ d is almost linear. Our 
attempt to estimate the higher-order term yields a ratio between the fourth- 
order and the second-order contribution of 0.01. Here and below we use for the 
angular dependence of the MAE the expression [44] 

■&) = Eo — K2 cos^ 'd K 4 ±cos‘^'& ^'411(3 + cos 4 (/j) sin^ d . ( 14 ) 

2 8 

Whether the reorientation of the magnetisation between the in-plane and out- 
of-plane orientations happens continuously or abruptly depends on the sign of 
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Fig. 2. Band energy difference plotted as a function of d for 2 Ni ML films on Cu(OOl). 
The solid line represents the fit by a parabola. 



^ 411 . For a stabilisation of a tilted magnetisation (and hence a continuous second 
or higher order spin-reorientation transition) a negative is needed. The value 
of the fourth-order in-plane anisotropy constant can be obtained from the 
variation of the MAE with cos 4(^9. In the last row of Table 7 we show our results 
for A' 4 ||. It can be concluded that its thickness dependence is rather complex and 
the 7^411 changes sign between 3 and 5 ML. Because these values are typically as 
small as few /reV we carried out additional calculations for several intermediate 
angles between 0 and 45° for 2, 3 and 7 ML films (Fig. 3). From the scatter of 
the points around a linear fit versus cos Aip we estimate that the confidence level 
in the numerical accuracy of our approach is better than 0.3 /xeV/atom in all 
cases. Actually, the curves for 2 and 3 ML seem to be modulated systematically. 
Despite of the exceedingly small values of the fourth-order MAE the calculated 
values are in a reasonable agreement with experimentally observed trends. 

In order to understand the influence of the surface and of the tetragonal 
distortion of the films on the magnetic anisotropy, a correlation between the 
number of holes in the Ni-d band (as observable in near-edge x-ray-absorption 
fine-structure (NEXAFS) experiments) and possible anisotropy of the d-band 
occupation has been evoked [47]. It has been argued that the number of holes in 
the Ni-d band is strongly reduced in the thinnest films, converging to a bulk-like 
value at a thickness of about 5 ML. In addition, an in-plane character of the 
d-holes irrespective of the thickness has been reported. 





E(0) - E(cp) (|j.eV/atom) 
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-1,0 -0,5 0,0 0,5 1,0 
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Fig. 3. Band energy difference plotted as a function of ip for 2, 3 and 7 Ni ML films 
on Cu(OOl). 



In Tabic 8 wc present the number of 3d-holcs (unoccupied states) separated 
into holes in in-planc orbitals {xy, for the (001) plane) and out-of-planc 

orbitals {yz, zx and for the (001) plane). It is obvious that the 3d- 
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Table 8. Numbers of 3d-holes with the in-plane (d||, left columns) and out-of-plane 
{d±, right columns) symmetry for Ni atoms in one to seven Ni/Cu(001) films. The ratio 
d|l/d± determines the character of the holes (dy/du >2/3 means in-plane character, 
d\i/d± < 2/3 means out-of-plane character). The results for bulk fct Ni are shown in 
the rightmost column. 



ML 




1 


2 




3 




5 






7 


fct Ni 


1 


0.73 


0.62 


0.70 


0.74 


0.67 


0.77 


0.68 


0.76 


0.66 


0.78 




2 






0.53 


0.88 


0.51 


0.93 


0.51 


0.93 


0.51 


0.93 




5 










0.59 


0.86 


0.56 


0.92 


0.56 


0.93 




6 














0.55 


0.92 


0.56 


0.92 




7 














0.58 


0.85 


0.56 


0.92 






















0.54 


0.94 






















0.57 


0.86 




d\\,± 


0.73 


0.62 


0.61 


0.81 


0.59 


0.85 


0.58 


0.88 


0.57 


0.90 


0.55 0.91 


d\\ H" dj_ 


1.35 




1.42 




1.44 




1.45 




1.46 




1.46 


d\\/d± 


1.17 




0.76 




0.69 




0.66 




0.63 




0.61 



band filling is reduced progressively as the film thickness increases and at the 
same time the hole character changes from the in-plane to out-of-plane between 

4 and 5 Ni ML. The increase of the number of Ni holes with increasing film 
thickness compares well with the experimental observations reported in Ref. [47]. 
In addition we find that the ratio d\\/d± is enhanced in the surface layer over the 
value d||/dy =2/3 corresponding to an isotropic distribution of the 3d-holes. 
Again this agrees with the conclusions derived from the NEXAFS experiments 
where the anisotropy has been attributed to the tetragonal distortion of the films. 
However, whether this conjecture is correct remains to be verified by reference 
calculations for undistorted films. The in-plane character of the 3d-holes means 
at the same time that the 3d-electrons have perpendicular character, and this 
agrees with the positive contributions of the surface layers to the MAE. In the 
deeper layers, the ratio d\\/d± drops below 2/3 and is smallest in the subsurface 
layer and the second layer from the interface. Again this correlates well with the 
negative contributions to the MAE noted for these layers. Taking the average 
over the entire him, we hnd that the hole character changes between 3 and 

5 ML what correlates with the reversal of the MAE. Altogether this analysis 
demonstrates that there are important changes in the partial electronic density 
of states near the Fermi level as a function of the him thickness whose evident 
correlations to the MAE deserve further investigation. 
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4 Conclusions 

The RS-TB-LMTO formalism described in the first part of the paper, together 
with an ab-initio density functional approach to the reconstruction of the films, 
has been applied to study the variation of the magnetic anisotropy of Ni/Cu(001) 
films with increasing film thickness. The predicted tetragonal distortion of the 
deeper layers agrees with experimental observations, but there is disagreement 
concerning the obtained inward relaxation of the top layer - this is possibly 
related to the confrontation of the T = 0 K calculations with room-temperature 
experiments. 

For the magnetic anisotropy, we predict a very complex behaviour: the change 
from in-plane (1 ML) to perpendicular (2, 3 ML) back to in-plane(5, 7 ML) and 
eventually again back to perpendicular for thicker layers (as long as the film re- 
mains tetragonally distorted). A detailed analysis reveals a competition between 
surface and interface contribTitions favouring a perpendicular orientation and 
subsurface and subinterface contributions favouring in-plane orientation of the 
magnetic moments. The correlations to a changing anisotropic population of the 
Ni-3d bands have been investigated and found to agree with the interpretation 
of NEXAFS experiments. 
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Abstract. In this paper some problems experienced during studies combining real 
space and tight binding methods are addressed. These methods have been mainly 
used for studying the magnetic properties of thin films deposited on substrates and 
of multilayers taking into account interfacial imperfections. This paper is illustrated 
with calculations of the electronic structure of Fe/Cr multilayered systems which are 
particularly interesting. First, the use of d and spd tight binding parameterisations of 
the electronic structure for transition metals and its relation to the recursion technique 
is discussed. Second, some advantages of using real space cells for studying complex 
systems are presented. Finally, the application of these methods for systems presenting 
non-collinear magnetism is discussed. 



1 Introduction 

During the last ten years, the electronic structure of large and complex metallic 
systems has been extensively studied mainly due to the enhancement of the 
computer facilities. Powerful computers with large memories became available 
allowing to reach rapidly self consistency in the band structure calculations for 
cells containing up to a few hundred of heavy atoms. One of these kinds of 
systems, concerns the metallic multilayers presenting new magnetic properties 
like the Interlayer Magnetic Coupling (IMC) or the Giant Magneto Resistance 
(GMR) effect particularly interesting for applications. This paper deals with the 
use of the real space recursion technique for the study of the magnetic order in 
such metallic multilayers. 

The multilayered system built by alternating a m monolayers thick 

A layer with a n monolayers thick B layer consists in a long elemental chemical 
cell containing, in the simplest case, one non equivalent atom in the in plane 
cell and m + n atoms in the growth direction perpendicular to the plane of 
the layers. Because these multilayers are periodic in the 3 directions of space, 
the band structure is usually calculated in the k space of the reciprocal lattice. 
However, since the aim of more complete studies is usually to determine the 
magnetic properties for thin overlayers during the growth of the multilayer, to 
include interfacial imperfections, to relate the growth mode and the magnetic 
behaviour, ... a real space technique is used in order to have the possibility to 
calculate the electronic structure of all these situations with the same method. 

The aim of this paper is to discuss possible problems and solutions used 
H. Dreysse (Ed.): Workshop 1998, LNP 535, pp. 434-458, 1999. 
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during various studies of the Fe/Cr multilayered system. A d restricted tight 
binding modelling of the band structure has been first used mainly because 
it needs less computer time than a full spd description and allows to use an 
exact cluster for the recursion technique. More recently, due to physical lacks 
in some results, the description has been extended taking the spd hybridisation 
into account. However, in order to reduce the computer time, clusters smaller 
than the exact one are used for the calculation. In the first section, some possible 
choices for the shape of these “inexact” clusters are given and their use for large 
cells like the one of multilayers is discussed. In the second section, the advantages 
of using real space cells for studying such complex systems are discussed. Finally, 
in the last section, the application of these methods for systems presenting non- 
collinear magnetism is presented. 



2 Tight Binding Parameterisation 
and Reenrsion Techniqne 



2.1 The Recursion Technique 

This method, proposed by Haydock and Heine [1-3], is well suited for the deter- 
mination of the electronic structure when (i) the Hamiltonian H can be expressed 
in a finite basis of localized orbital |i,A) of symmetry A on the site i (like the 
one considered in the next subsection) and (ii) when the knowledge of the Green 
function Gi^\{z) = {i, X\G{z)\i, X) = {i, X\{z — X) elements is sufficient 

for the calculation of the band structure. For example, this method applies to 
situations presenting no symmetry like amorphous or disordered systems, aro- 
und impurities or structural imperfections. 

For each given site and symmetry (i, A), a new basis |n} is built in order 
to have a tridiagonal matrix for the representation of H in this new basis. The 
basis \n} is recursively obtained starting from the |i,A) basis function with the 
following expressions: 



|0} =\i,X) 

11} =H|0}-al’>} (1) 

|n -f 1} = H\n} - alll^\n} - bh^\n - 1}. 

The sets (ah^, bh^) are called the recursion coefficients. They are easily obtained 
by calculations of simple scalar products: 

i,\ _ {n\H\n} 

“"+1 {n\n} 

,x ^ {n\H\n+ 1} ^ {n+ l|n-H} 

{n|n} {n|n} 

The orthonormalized recursion basis is then obtained by: 



|n) 



|n} 

^/\n\n} 



(3) 
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and the recursion coefficients correspond to: 

= {n\H\n) 

\/^n+i = {n\H\n+l) = {n+l\H\n). (4) 

The desired tridiagonal matrix representation of H is consequently obtained: 




V 0 •••/ 



Since |i, A) = |0), the Green function is equal to (0|(2: — H) ^|0) which 

corresponds to the first clement [{z — H)' ^]oo of the inverse matrix of {z — 
H). This particular element is easily obtained by considering the determinants 
det(z — Hn) = \\z — Hn\\ where Hn is the part of the H matrix limited to the 
elements {|n), \n + l),\n + 2), ...} of the recursion basis, 



/ aJ , 



= 



n-\-l 



i.X 

^n+l 



0 \ 



T '<■ , A i,\ 

®n+l ®n+2 



i,X 

G+2 



ji-X i,X 
'^n+2 ®n+3 



i,X 

’n+3 



i,X i.X 
^n+3 “n+4 



V 0 

The desired Green function is then equal to 






= 



l-z - -f^o 



\z-H- 



1 



i,X\ 



[z - a{'}\\z - Hi\ \ - bi^\\z - H 2 \ 
1 



7 

2 m Wz-HA 



corresponding to the continuous fraction expansion 

1 



G,,x{z) = 



i,X 



z — a^ — 



t.A 



i,X 



( 6 ) 



( 7 ) 



( 8 ) 
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An analytical expression of the Green function is then obtained. However, an 
exact determination of Gi^\{z) requires an infinite number of recursion coefh- 
cients which is never the case in practical calculations. Usually, only a few levels 
{N pairs of {an, bn) coefficients) of the continuous fraction are determined and 
the missing part of the fraction expansion is replaced by a terminator function 
Si^\{z). The continuous fraction expansion becomes 



G^A^) 



i,\ 




(9) 



,i,X 

M. 



(jj 



For energy bands presenting no gap, the recursion coefficients have asymptotic 
limits corresponding to (a^, bA)- The most easy way to determine is to 

assume that for n > N the recursion coefficients are equal to their asymptotic 
values. The terminator function is then equal to: 

^i,\{z) = z 



A,\ 



?-,A 

Z Ojoc 



z — aA — 



= 2: 









1)1. 

00 






(10) 



The solution of this equation gives the square root expression of the terminator 
function: 






z — a. 



i.X 




(11) 



The Beer-Pettifor method is used [4] to determine (a)^,6)^) which is based 
on the calculation of the band limits (cmin, Cmax) given by (aA ~ 2\/bA, a^ + 
2'/hA)- When the continuous fraction is truncated at the level N, the projected 
density of states (PDOS) 



riz,A) = --Im(G,,A(e + *0)) (12) 

7T 

corresponds to a sum of Dirac functions and the band limits can be identified by 
the energies of the lowest and highest Dirac functions. This method is extremely 
easy to use and, because it needs only to determine the diagonal representation of 
relatively small matrices, is also rapid and niimerically stable. However, because 
the band limits correspond to Dirac functions, using the exact values obtained 
by this method gives usually diverging values for the PDOS near the band limits. 
Since the band width is underestimated by this method (due to the truncation 
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of the continuous fraction), this problem can be nicely solved by enlarging the 
calculated band by 1%. One way to check the efSciency of this method for the 
determination of the terminator function is to verify that the PDOS are correctly 
normalized which is usually the case with an error smaller that 10^®. 

For each level of the recursion calculation (2), the hamiltonian H is applied 
on the n-th element |n} of the new basis. This adds to the expression of |n + 1} 
all new “neighbours” linked through H of all sites included in the expression of 
n}. In other words, at each level of the recursion calculation, the contribution 
of the n-th shell of “neighbours” of the starting site |f, A) is taken into account. 
Consequently, the number of sites needed for calculating N exact levels of the 
continuous fraction (or N exact pairs of recursion coefEcients) is proportional to 
[5]. 



2.2 Tight Binding Hamiltonian 

In the Linear Combination of Atomic Orbitals method, the one-electron wave 
function jlF) is expressed as a linear combination of localized atomic orbital |f, A) 
on site i and spin-symmetry A: 

|>F) = ^q,aK,A). (13) 

i,X 

Solving Schroedinger equation becomes the eigen-value problem 
X^O,A((j,4t|-ff|c A) - e(j,^i|z, A)) = 0 

i,X 

=0. (14) 

i,X 

The overlap matrix plays an essential role when the atomic orbital can not 
be assumed as being orthogonal [6]. In this work, it is assumed that S is equal 
to identity The hamiltonian can then be directly expressed in 

the atomic orbital basis 

H= (15) 

which can be split into intrasite and intersite terms 



the on-site energy levels and /?X are the two-center hopping inte- 
grals linking sites i and j. In order to reduce the numbers of parameters of the 
hamiltonian, the on-site energy levels are assumed to be equal to 







I, 









^i,X,fi — 



2 



(17) 
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where ANij(^x'f and Ii^i(\) are respectively the spin independent 

energy level reference, the effective Coulomb integral, the charge variation and 
the effective Exchange integral of site i, is the local magnetic moment, 

l{\) and ux being respectively the orbital quantum number (corresponding to s, 
p or d states) and the spin of the spin-symmetry A. The charge variation ANi^px) 
and the local magnetic moment Mi^px) £ire obtained from the PDOS by 

ANi^i = '^6px),i [ n,^x{^)dt ~ Nil (18) 

A 

where Nh is the bulk I band occupation of site i and 

/ €f 

- 5cT^,-U,,,A(e))* (19) 

A 

Equations (17), (18) and (19) define the self consistency solution. The input 
charge variations and magnetic moments allow to calculate the input energy 
levels of the hamiltonian for which the output PDOS are determined and in- 
tegrated to obtain the output charge variations and magnetic moments. Self 
consistency is reached when input and output quantities do not more differ sig- 
nificantly. 

Usually, four energy levels e®, e®, and are needed for each site and the 

hopping integrals between each pair of sites are expressed in terms of ten 
simple Slater-Koster matrix elements [7] sscr, spa, sda, ppa, ppir, pda, pdir, dda, 
ddn and ddS. It has been shown that a good description of the band structure 
can be obtained by limiting the hopping integrals to nearest neighbours pairs of 
sites. Various sets of hopping integrals for a given element can be found in the 
literature showing that the Slater-Koster matrix elements do not have an unique 
value [8]. 

As an illustration, the density of states (DOS) obtained using the Slater- 
Koster parameters deduced by fitting ab initio band structure calculations by 
Papaconstantopoulos [9] and those obtained using the TB-LMTO method by 
Andersen et al [10] are compared. Fig. 1 shows the DOS obtained for bulk non 
magnetic Chromium using these two sets of parameters. The d bands arc very 
similar for both calculations. For the s and p DOS, if the bottom of the bands for 
both calculations are very similar, the DOS differ significantly for higher energies 
(larger than 15 eV). Due to the finite band width of tight binding DOS, the band 
structure shows non physical structures at high energies for all sets of parame- 
ters. However, with tight binding parameters taken from Papaconstantopoulos, 
the total band width {W) is approximately equal to 45 eV whereas with the 
second set of parameters W is approximately equal to 25 eV. Since the energy 
resolution of the DOS calculated with the recursion method is proportionnal 
to W divided by the number of levels, the larger IV is, the smaller the energy 
resolution of the DOS - for a same number of recursion levels - is. Consequently, 
the d DOS - which is the most important for itinerant magnetism properties - 
exhibits more fine structures when W is smaller (Fig. l.b) than for the other 
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Fig. 1. s, p, d and total (bold line ) densities of states obtained with 24 exact levels of the 
continuous fraction for bulk non magnetic Chromium with tight binding parameters 
taken from a. Papaconstantopoulos [9] and b. Andersen et al [10] 



calculation (Fig. l.a). It is then essential to choose tight binding parameters 
giving the smallest total band width in order to describe more precisely the d 
band of transition metals around the Fermi level. This is why, in the following, 
the tight binding parameters deduced by Andersen et al [10] are used. 

2.3 Clusters for the Recursion Method 

It has been shown previously that, for each additional level of the recursion 
fraction, the next shell of “neighbours” of the starting site |0} is taken into 
account. For the calculation of A^exact recursion levels, we have to built a cluster 
containing all sites which will contribute. This exact cluster correspond to all 
sites geometrically included in the A^exact-fh shell of “neighbours” . The shape of 
the exact cluster depends on the crystallographic structure and the cut-off Tc of 
the hopping integrals ( (3^’-^ = 0 when |ri — r^j > Tc) which can be approximated 
by a sphere centered on the starting site of radius R = R* = Alexact-?'c (this 
sphere contains the exact cluster). The number of sites A"site included in the 
sphere increases with A^exact like and the computation time increases in 

the same way. This is why, in most cases smaller clusters are used. However, the 
recursion coefficients are affected by the size and the shape of these “inexact” 
clusters. 

As an illustration, cubic and spherical “inexact” clusters are built and the 
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DOS obtained for various cluster sizes are compared. The minimal cubic cluster 
containing the exact cluster includes all sites i verifying \xi\ < a/ 2 , \yi\ < a/2. 
\zi\ < a/2 with a = a* = 2iVexact-fc- “Inexact” clusters are built with a smaller 
radius for spheres or a smaller edge for cubes. They are defined by the ratio 
p = R/R* for spherical clusters and p = a/ a* for cubic ones. 

Tables 1 and 2 present some characteristics of such “inexact” clusters and 
Figs. 2 and 3 display the DOS obtained for two of these examples. These results 
(Tables 1 and 2) shows that, as expected, for a similar number of sites in the 
clusters, the number of exact levels obtained during the calculation of 24 levels, is 
larger for a spherical cluster than for a cubic one. However, the comparison with 
the exact calculation (Fig. l.b) shows that the DOS (Figs. 2 and 3) obtained with 
spherical clusters exhibit more non physical peaks (mainly near the bottom of the 
DOS) than the others; these peaks are mostly found in the s band. This can be 
easily understood because the spherical cluster is a much better approximation 
of the exact cluster than a cube and consequently the missing sites correspond 
to complete shells whereas, for cubic cluster, they correspond to fractions of 
shells. During the calculation of successive levels, the missing sites have a more 
progressive impact when cubic clusters are used than with spherical ones for 
which the impact occurs abruptly at a given level. In this work, cubic clusters 
arc used, in order to keep the cubie symmetry of the considered crystals, with 
p = 0.25 and the DOS are determined with 24 levels in the continuous fraction. 
Of course, for other crystals, other “inexact” (non spherical) clusters have to be 
consider. 

The previous considerations on the cluster shape have been done for bulk 



Table 1. Some examples of “inexact” cubic clusters for various ratio p. The number 
of exact recursion levels and the computation time required for calculating 24 levels 
are displayed; the cluster built with p = 1 contains the exact cluster needed for the 
calculation of 24 exact recursion levels 



Linear scale 
ratio p 


Number 
of atoms 


Number of 
exact levels 


Computation 
time (s) 


0.167 


855 


5 


16 


0.25 


3925 


11 


38 


0.5 


29449 


24 a 


238 


1 


228241 


24 


6106 



^ differences between calculated and exact coefficients smaller than 10 



situation for which there is only one non eqTiivalent atom in the unit cell. For 
cells containing a large number of non equivalent sites, there are two possible 
ways to build an “inexact” cluster (Fig. 4): (i) building a set of cubic clusters 
centered on each non equivalent site or (ii) building an unique cluster by joining 
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Table 2. Some examples of “inexact” spherical clusters for various ratio p. The number 
of exact recursion levels and the computation time required for calculating 24 levels 
are displayed; the cluster built with p = 1 contains the exact cluster needed for the 
calculation of 24 exact recursion levels 



Linear scale 
ratio p 


Number 
of atoms 


Number of 
exact levels 


Computation 
time (s) 


0.195 


869 


6 


16 


0.325 


3942 


15 


40 


0.635 


29627 


24 a 


250 


1 


115633 


24 


1556 



^ differences between calculated and exact coefficients smaller than 10 





Fig. 2. s, p, d and total (bold line) densities of states obtained with 24 levels of the 
continuous fraction for bulk non magnetic Chromium for a. a cubic cluster with p = 
0.167 and 6 . a spherical cluster with p = 0.195; both clusters contain approximately 
850 sites 
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Fig. 3. s, p, d and total (bold line) densities of states obtained with 24 levels of the 
continuous fraction for bulk non magnetic Chromium for a. a cubic cluster with p = 0.25 
and b. a spherical cluster with p = 0.325; both clusters contain approximately 3900 
sites 



together all clusters built previously. In the first construction, if the unit cell is 
longer than a/2, all non equivalent sites are not included in one cluster (Fig. 4. a). 
This is problematic when interactions between distant atoms are studied because 
one of the atoms is outside the cluster and no direct interactions are taken into 
account. The second cluster construction solves this problem by including all 
atoms of the unit cell in the cluster. However, this long cluster breaks the cubic 
symmetry and the different sites are no more equivalent from the point of view 
of the recursion calculation. 

In order to have an idea of the fluctuations introduced by such a long cluster, 
the bulk antiferromagnetic Chromium situation is considered where all atoms are 
equivalent and the length n of the unit cell is artihcially increased. With exact 
clusters, the result does not depend on the size of the cell. The results obtained 
with the “inexact” cluster are presented on Table 3. The local magnetic moment 
fluctuates only very slightly (fluctuations from site to site smaller than 10^® /tb) 
but it decreases when the unit cell is increased. The on-site energy shows more 
pronounced fluctuations from site to site. This shows clearly that the size and 
the shape of the cluster play an essential role on the calculated properties when 
the unit cell is varied. This is exactly what is usually done for multilayers where 
the properties are studied as a function of the layer thickness. 
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a. 



b. 



1 1 
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Fig. 4. Schematic representa- 
tion of the two kind of clu- 
sters considered for long unit 
cells (dashed line) containing a 
large number of non equivalent 
sites (filled circles): a. a set of 
cubic clusters centered on each 
non equivalent site are built, b. 
an unique cluster containing all 
clusters of the (a) situation is 
built 



Table 3. Magnetic moment values (M) and energy on the sites i in the unit cell of 
length n which are not equivalent in the “inexact” unique cluster relative to the value 
obtained with a single atom in the unit cell AE{i) = E(i) — E{0, n = 1) 





M 

(mb) 


AE{0) 

(meV) 


AE{1) 

(meV) 


AE(2) 

(meV) 


n = 1 


0.6001 


0 






n = 2 


0.5868 


-0.52 






n = 3 


0.5796 


-0.50 


-0.69 




n = 4 


0.5743 


-0.68 


-0.72 




n = 5 


0.5731 


-0.75 


-0.75 


-0.75 


n = 6 


0.5727 


-0.76 


-0.76 


-0.76 


n = 11 


0.5725 


-0.76 


-0.76 


-0.76 



3 Periodic Versus Real Space Cells for Studying Bulk 
Magnetic Wall in Cr 

The study of the Interlayer Magnetic Couplings (IMC) in FemCr„ as a function of 
n requires the determination of the total energy from the electronic structure for 
various interlayer magnetic arrangement (IMA). Usually, they are restricted to 
collinear “ferromagnetic” (F) and “antiferromagnetic” (AF) IMA corresponding 
respectively to parallel and antiparallel magnetisation of successive Fe layers. 
The IMC are positive (respectively negative) as expected from the occurrence 
of a central magnetic defect in the Cr spacer when its thickness corresponds to 
an odd (even) number of atomic layers with an AF (F) interlayer arrangement 
[11]. For large spacer thickness, this defect becomes a bulk wall in the [001] 
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direction in a layered antiferromagnetic crystal. This bulk wall resembles to a 
Bloch wall in a ferromagnet separating two domains in the limit of a very strong 
anisotropy giving collinear magnetism. In the present case, the bulk wall in a 
antiferromagnet corresponds to an antiphase in the layered antiferromagnetic 
order. It is easy to build such magnetic configurations using periodic or real 
space cells. The periodic cell consists in a CriCr 2 „-i superlattice with an AF 
interlayer arrangement between the Cri layers (Fig. 5. a). The real space cell 
consists in a block of 2n — 1 atomic planes on which the wall is induced by a 
symmetry relative to the central atomic plane is applied (if this plane has a 
zero index, the symmetry corresponds to a magnetic moment on the i-th atomic 
plane given by Mi = —M^i), this block being surrounded by blocks for which 
the magnetic moments are frozen in a AF bulk-like configuration (Fig. 5.b). 



a 








CFj Cr2n-1 CFj ^*^1 

^ Symmetry 


m 

' V — 


imttHtijntnt 

1 

V 


itmm 

V 





AF frozen Real Space AF frozen 
block Cell block 



Fig. 5. Schematic representation of the cells used for calculating the bulk magnetic wall 
in AF Cr: a. a periodic cell built like a CriCr 2 n-i superlattice with an AF interlayer 
arrangement, b. a real space cell, containing 2n — 1 atomic planes with a symmetry ap- 
plied relative to the central atomic plane reversing the magnetic moments, surrounded 
by bulk-like frozen AF Cr blocks 



The calculation of the magnetic structure using a periodic cell is similar to the 
one of FemCr„ superlattices: the Fermi level Ep is given by the global neutrality 
requirement: 



( 20 ) 



With the real space cell, the situation is more complex because the two semi- 
infinite bulk-like frozen AF Cr blocks fix Ep to the bulk value. The local energy 
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levels €i^i are adjusted requiring the local neutrality: 

(21) 

i 

which is a reasonable approximation since the charge variations in transition 
metals are usually small. 

In this section, the study is restricted to a d description, because for collinear 
magnetism, the results obtained with this band structure description are more 
than qualitatively correct [12]. The characteristics (extent and energy) of the 
AF bulk wall can be determined with both cells using large thickness. In this 
paragraph, by increasing n, the two approaches (periodic and real space) are 
compared and the minimal value for n, for which the characteristics of the bulk 
wall do no more change significantly, is determined. With a periodic cell, the 
criterion is to recover the bulk value of the magnetic moment on the Cri atomic 
plane and, with the real space cell, it is to have a continuous behaviour at the 
frontier between the cell and the “frozen” blocks. Fig. 6 presents these values a 
function of n. The criterion is more rapidly satisfied with a real space cell than 
with a periodic one. Moreover, all magnetic moments are found equal to zero 
for periodic cells with n < 9 whereas with this value for n, 90 % of the bulk 
moment is reached with the real space cell. This result is not very surprising 
since, with the real space cell, the moment at the frontier between the cell and 
the “frozen” blocks is strongly maintained by the proximity of a frozen bulk 
magnetic moment on one side and only slightly reduced by the magnetic defect 
on the other side. For periodic cells, the magnetic defect cancels all moments 
in too small cells. This illustrates the efhciency of the use of real space cells for 
studying non interacting magnetic configurations. 

Using the real space approach, the characteristics of the bulk collinear ma- 
gnetic wall in Cr can now be investigated. The insert of Fig. 7 shows the magnetic 
moment profile of the wall: its extent is found approximately equal to 40 atomic 
planes. This shows that a frustration in the Cr AF order has repercussions over 
a large range of planes making this spacer particularly suited for studies with 
the present light approach since a large number of Cr atoms are concerned. Fi- 
nally, the asymptotic limit of the energy of this wall as a function of n seems to 
be jcr = 21 meV per in plane atom (Fig. 7). This energy corresponds to the 
energy of the interlayer couplings obtained for large Cr thickness demonstrating 
that the coupling energy does not decrease in the collinear restriction when the 
spacer thickness increases in such Fe/Cr superlattices. 

4 Non-Collinear Magnetism 

4.1 Continuous Fraction Expansion 
and Non-Collinear Magnetism 

All studies presented in the previous section have been realized in the collinear 
magnetism framework which saves computer time but represents a strong limi- 
tation as compared to experiments. The band structure non-collinear magnetism 
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Fig. 6. Magnetic moments values relative to the bulk value on the Cri atomic plane 
{filled squares) of the CriCr 2 n-i periodic cell and on the atomic plane at the frontier 
between the cell and the “frozen” blocks {open circles) 



allows to include the angular degree of freedom in the self consistent calculation 
[13,14]. The magnetic moment is a vector having 3 components My and 
Mz or can be described by its magnitude Mr and two spherical angles 0 and (j). 
First, the input hamiltonian expression has to be modified in order to take into 
account the varying local spin quantization axis Q (whose direction is given by 
the two usual spherical angles 9i and <pi) for each site i and, second, to determine 
the components Mg j and M 0 j of the output magnetic moment. 

The hamiltonian of (16) can be rewritten as a sum of a band i?band and an 
exchange Hexch hamiltonian where: 



and 



.f^band 



lb A)(e°A + A| 

i.X 



+ E 

(*.A) 



lb 0)7*1 




i/excb = E 

i,X 



.A)- 



-hM\)Mi,i(\) 



,A| 



cosOi 



e “^’sin 6 >A 
— COS0i J 



(22) 



(23) 



In these expressions, the spin part of the hamiltonian is represented by the 2x2 
matrix which is site dependent only in i/exch- Expression (23) is obtained by ap- 
plying a rotation on the cr^ Pauli matrix in order to align the local quantization 
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Fig. 7. Energy (in meV per in plane atom) of the bulk collinear magnetic wall as a 
function of the size n of the real space cell. The insert presents the magnetic moments 
profile of the wall 



axis Q with the global z axis. 

The PDOS on site i for the symmetry A is obtained with the recursion tech- 
nique by setting the starting element |0) of the recursion basis equal to \i, A). If 
the spin states are represented with the 2 : quantization axis by a two elements 
vector, we have: 

|f, A, c7a = -h) = |f, A') , |i, A ,cta = -) = |f, A') (24) 



where X' corresponds to the symmetry of the spin symmetry A. The PDOS on 
an arbitrary axis ^ defined by the two spherical angles (6>, (?) in the spin space 
is obtained by starting with the following initial recursion basis element: 



i,X,ax = +)^ 



i, A, fjA ) 



|i, A') 
|i, A') 



/ e ‘^/^cos® \ 
lAe-"‘^/2sinf ) 

f — e'^/^siny A 
y e“^/^cos^ J 



(25) 



For the determination of Mg^i and using (19), the majority and mi- 
nority spin states densities of states have to be calculated for all sites i and 





Combining Real Space and Tight Binding Methods 449 



symmetries A' for ( = Q aligned with (0 = Oi,^ = (pi), for C aligned with 
(0 = Oi + 7t/2, = (pi) and for C, aligned with j (0 = tt/2, = pi + tt/2). 

The first step of the non-collinear study has been done using a d restricted 




Fig. 8. Schematic representation of the slightly non-collinear magnetic configuration 
built by tilting the magnetic moment of one over two (001) atomic plane by a small 
angle 59 



band structure description. In most cases studied, a discontinuous behaviour 
between collinear and slightly non-collinear calculations has been found. For ex- 
ample, a slightly non-collinear magnetic configuration by tilting the magnetic 
moment directions of half the (001) atomic planes by a small angle 56 is built as 
shown by Fig. 8. An antiferromagnetic order like in Cr is obtained for 59 = tt. 
When 59 is varied starting from zero (ferromagnetic order) and increased pro- 
gressively, a nice continuous behaviour should be obtained. This is not obtained 
with a d band structure as shown by Fig. 9: the magnetic moment and the 
energy show rapid and large variations when 59 is varied from zero to 2°. On the 
contrary, the expected nice behaviour is obtained with a spd band structure as 
shown on the same figure. This peculiar behaviour comes from the not spin (+) 
and (— ) mixed d densities of states in the ferromagnetic collinear configuration 
{59 = 0): the (+) and (— ) d densities of states are determined completely inde- 
pendently from a calculation where all 2 x 2 matrices of (25) are diagonal. Since 
it is assumed that the tight binding parameters arc not spin dependent, the (+) 
and (— ) densities of states are the same (they have the same band width W and 
are only split in energy by h^dMi^d) but they have different band limits (Fig. 10) 
and the (+) and (— ) spin recursion coefficients a\i^ have different limits: 



a 



2,d,+ 

CX) 




i,d,+ I i,d,+ 

^min ' ^max 

— 




h,dMi^d 



(26) 



When, the magnetic configuration is non-collinear (even slightly non-collinear), 
the (+) and (— ) spin states are mixed and the (-I-) and ( — ) densities of states 
have the same band limits (this is not exactly the case in Fig. 10 because only 8 
recursion levels are used but larger band widths are obtained) and the recursion 
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012345 012345 

80 C) 50 (“) 

Fig. 9. Magnetic moments and energy variation as a function of the angle SO of the 
magnetic configuration represented in Fig. 8 obtained with a d (filled circles) and with 
a spd (open circles) band structure 



coefficients ,b]i^) stiould liave the same [a.]^ ^V^) limit: 



i.d,— 

®oo ~ “oo 



^OO ^oo I ^ 



This explains the large differences in the densities of states represented by Fig. 10. 
The most significant changes are mainly noticeable at the top of the majority 
spin band where a large peak occurs around one eV even for a very small 59 value. 
Such an unphysical behaviour is of course not obtained with an spd hamiltonian. 
In this case, the d band which carries the magnetism is hybridised with the s and 
p bands having a large band width and consequently the recursion coefficients 
limits are nearly insensitive to changes in the magnetic configuration. 

For non-collinear studies, a restricted d hamiltonian has to be used very 
carefully in order to avoid uiiphysical results related to numerical problems in 
the continuous fraction expansion. All these problems are nicely solved when a 
spd hamiltonian is used. 



4.2 Angular Dependence of the Interlayer Magnetic Couplings in 
Fe/Cr Multilayers 

The interlayer magnetic couplings discussed previously can now be studied in 
the non-collinear framework and their angular dependence can be investigated. 
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£ (eV) £ (eV) 

Fig. 10. Densities of states for bulk Fe obtained considering the magnetic configuration 
of Fig. 8 with 66 = 0° (ferromagnetic order) and 66 = 1° (slightly non-collinear or- 
der) . The majority (respectively minority) spin bands are represented by solid (dashed) 
lines 



This is done by fixing the angle A9 between the directions of the inner magnetic 
moments of successive ferromagnetic layers during the self-consistent calculation 
as displayed by Fig. 11. During the calculation, all not fixed magnetic moments 




Fig. 11. Schematic representation of the magnetic configuration considered for the cal- 
culation of the interlayer magnetic couplings. Each vector corresponds to the magnetic 
moment of all atoms in the (001) atomic plane which are equivalent 



are free to rotate and self-consistency is assumed to be achieved when the output 
perpendicular components Mg^i and on all these sites i are nearly equal to 
zero. In this paper, the angular variation is restricted to 6 in order to reduce the 
computer time but also because it has been checked that all magnetic moments 
vectors are in the plane defined by the two fixed magnetic moments. 

Because the d states are the most essential for the magnetism, the effective 
exchange integrals Is and Ip are usually equal to zero. However, if this is usually 
arbitrarily assumed, setting Ig = Ip = 0 is now required because we obtain self- 
consistency by an iterative way for non-collinear solutions. This comes from the 
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expression of the output magnetic moment in terms of s, p and d contributions 
in the input and spherical basis: 



M. 



(out) ^ A^(out) 






t,r,p ' t,r,d 



■M, 



(out) 



i,9,p 



■M, 



(out) 



1.9, d 



M 



(out) 



= M 



(out) (in) 



^(out) (in) 



(28) 



The magnetic moment used as input for the next iteration is obtained by varying 
9i in order to align the local quantization axis with the direction of the output 
magnetic moment given by: 



^(next in) _ ^(next in) (out) 



(29) 



in the output and spherical basis. The s, p and d decomposition of 

the next input magnetic moment has to be obtained: 



M, 



(next 



= M, 



(next 



M, 



(next 



t,r,p 



M, 



(next in) 



(30) 



which is equal to the magnitude of the output magnetic moment vector 




M. 



(out) 






(m); 



(out) 

e,s 



M. 



(out) 

i,9,p 



M, 



(out) s 

i,e,d ) 



(31) 



However, this s, p and d decomposition is lost when the magnitude of 
given by (31) is calculated. If Is = Ip = 0, the results are the same whatever 
the values of and M^Zp are (they do not contribute to the exchange field) 

and we have to do the self-consistent calculation only for the d component of 
the magnetic moments: 



. ,(next in) 




The calculation is assumed to be converged when 



Max.{|A<°J-M)“)|}<e 

MaxillM)^"*^]} <e 

<e' 



(32) 



(33) 



with e = 5 X 10^® /tb and e' = 10^® eV. 

Fig. 12 presents the interlayer magnetic couplings obtained with the d and 
the two spd tight binding parameters as a function of A9 for Fe 5 Cr 4 and FesCrs 
superlattices. Around the energy minimum, the coupling energy follows a para- 
bolic expression — for n = 4 and C^{A9)‘^ for n = 5 as predicted by 

a phenomenological model [15]. The couplings obtained with the d tight binding 
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a. FCjCr^ b. FCjCr^ 





Fig. 12. Interlayer magnetic couplings AE{A6) ~ E{tv or 0) obtained with the d {filled 
diamonds) and the spd tight binding parameters without {filled circles) and with {filled 
squares) a smaller interfacial Id as a function of A9 for Fc 5 Cr 4 and FesCrs superlattices. 
The open symbols correspond to the collinear solution 



parameters follow the parabolic function over the whole range of A9 considered. 
The collinear solution obtained for the frustrated configuration does not corre- 
spond to the solution obtained for A9 = 0 (n = 4) and tt (n = 5) and 
has a higher energy. This is not very surprising since, with these parameters, 
the Fe and Cr magnetic moments do not vary significantly when A9 varies and 
the local angles 9i vary nearly linearly with A9 as previously reported [16]. Such 
a nearly constant magnetic moment magnitude behaviour corresponds exactly 
to the phenomenological model which assumes a helical configuration in a Hei- 
senberg model for the antiferromagnetic spacer. The behaviour of the coupling 
energies for spd tight binding parameters is completely different: they follow the 
parabolic function only over half the range of A9 considered. This is particularly 
pronounced when the interfacial Cr Id is reduced from 0.96 eV to 0.90 eV (in 
order to have a better agreement with ah initio calculations) where the coupling 
energies show a maximum at A9 = 0 (n = 4) and tt (n = 5). Moreover, the 
frustrated collinear solution energies are nearly degenerate with the ones of the 
corresponding solution obtained with the non-collinear calculations. For exam- 
ple, for FesCr 4 superlattices, during the decrease of A9 from tt to 0 (i) the Fe 
magnetic moments have a nearly constant magnitude (ii) the magnitude of the 
magnetic moments on the Cr atoms decreases strongly when A9 reaches 0 and 






454 



Clara Cornea and Daniel Stoefller 



the magnetic moment of the central Cr atomic planes nearly vanishes, (iii) the 
local angles vary linearly when A6 decreases from 180° down to approximately 
60°; for smaller A6 values they vary very rapidly and reach values corresponding 
to the frustrated collinear magnetic conhguration for A9 = 0. 

This result explains qualitatively the different behaviours experimentally 
obtained for FeCo/Mn superlattices where the parabolic function applies [17] 
and for Fe/Cr super lattices where the saturation is better reproduced with a 
Ji.cos(Z\0) + J2.cos^(Z\0) expression for the coupling energy [18]. 



4.3 Step Induced Non-Collinear Magnetism 

In the previous paragraph, the non-collinear character is induced by the variation 
of A9. This is similar, for n = 4, to the situation where an increasing external 
magnetic held is applied on the multilayer. Non-collinear magnetic conhgura- 
tions can also be obtained when interfacial imperfections frustrate the natural 
magnetic order in the multilayer. 

This is illustrated by Fig. 13 for interfacial atomic steps. Because Cr is an- 




Fig. 13. Schematic representation of the frustration induced by an interfacial atomic 
step in an Fe/Cr/Fe sandwich: the Cr thickness tcr variation from 4 to 5 atomic planes 
(AP) a. splits the second Fe layer into domains of opposite magnetisation when the 
interfacial coupling is preserved or 6. induces a 90° interlayer arrangement when the 
interfacial coupling is partially frustrated 



tiferromagnetic, the sign of the interfacial Cr magnetic moment changes from 
one (001) atomic plane to the next. If the antiparallel interfacial Fc-Cr coupling 
and the Cr antiferromagnetic order (for small Cr thickness) are preserved, the 
interlayer magnetic coupling changes from AF to F when the Cr thickness tcr 
varies from n = 4 to n = 5 atomic planes at the atomic step (Fig. 13. a). 
The second Fe layer is then split into domains of opposite magnetisation and 
the domain walls correspond to the steps [19]. However, if we allow non-collinear 
magnetism, the second Fe layer can be nearly monodomain if its magnetisation is 
perpendicular to the one of the first Fe layer as shown by Fig. 13. b where we have 
assumed that only the interfacial Fe-Cr coupling is partially frustrated. This be- 
haviour, resulting from the competition between the strong Fe ferromagnetism 
and the fluctuations of the interlayer coupling, is usually invoked for explaining 
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the occurrence of 90° interlayer arrangements but does not result from an int- 
rinsic biquadratic interlayer coupling. The aim of this part is to use our method 
to determine explicitly the magnetic moment map for such configurations. 

For these calculations, all magnetic moments are free to rotate until self- 
consistency is achieved. However, because we do not include the spin-orbit cou- 
pling, a global rotation can be applied on all moments without changing the so- 
lution (there are no privileged direetions for the magnetisation). Consequently, 
when all moments are effectively free to rotate, the calculation can never con- 
verge because a global rotation can occur during each iteration and 0i can never 
reach its asymptotic value. Fixing arbitrary one moment (on site 0 for example) 
and allowing all others to rotate is not the best way to obtain the self-consistent 
map: the torque applied on the fixed site by all other sites is usually very large 
and a large number of iterations is needed in order to reduce significantly 
and to satisfy the convergence criteria (33) on all sites. A better way is to applied 
a global rotation at each iteration in order to keep the moment on a given site 
(the site 0) fixed in direction. The value for 9i at the next input is then 

^(next in) _ ^(out) _ ^(out) 



and the angular self-consistency is obtained when 



Maxi{|M) 



(out) 



0,d 






(out) 



i,r,d 



q(oUt) 



}< 6 . 



(35) 



In this case, the hnal map is obtained when the magnetic moments do no more 
rotate relatively each others even if they continue to rotate globally from one 
iteration to the next. 

Superlattices with atomic steps at one Fe/Cr interface like in Fig. 13 are mo- 
delled by periodic superlattices having a perfectly flat Fe layer separated from 
a rough second Fe layer by the Cr spacer with varying thickness. In the cases 
considered in this paper, i.e. atomic steps along the [010] direction with flat ter- 
races having all the same size of 5 atomic rows, the cell corresponds to the lateral 
juxtaposition of 5 Fes/Crs/Fes/Crs and 5 Fe 5 /Cr 4 /Fe 7 /Cr 4 cells (see Fig. 14). 
The total real space cell contains 110 non equivalent sites. The calculations have 
been done using the d restricted parameters and the two sets of spd parame- 
ters previously used. The magnetic moment map obtained with the d restricted 
parameters of Fig. 14 shows clearly that (i) the rough Fe layer is structured in 
domains of opposite magnetisation corresponding exactly to the terraces, (ii) 
only the Fe atoms at the border line of the domains have a local magnetic mo- 
ment perpendicular to the others, and (hi) the Cr spacer layer displays only a 
slight non-collinear character. This result does not correspond to the expected 
90° interlayer arrangement and presents a strongly reduced magnetisation of the 
‘rough’ Fe layer. On the contrary, with the spd parameters, the ‘rough’ Fe layer 
is (i) only slightly structured in magnetic domains having their magnetisation 
making an angle of approximately 80° and not 180°, (ii) the Cr spacer layer pre- 
sents a more pronounced non-collinear character, and (iii) the magnetisation of 
the ‘rough’ layer, which is only slightly reduced, is preferentially perpendicular 
to the one of the flat Fe layer. The two sets of spd parameters used in this work 
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Fig. 14. Magnetic monicnt maps of a Fe. 5 . 6 Cr 4.6 supcrlattice having a perfectly flat 5 
atomic planes thick Fe layer {bottom layer) and a rough Fe layer whose thickness varies 
from 5 to 7 atomic planes {top layer) separaterl by a Cr spacer layer whose thicknes.s 
varies from 4 to 5 atomic planes obtained with the d restricted parameters (upper map) 
and with the spd parameters and the reduced Cr interfacial Id (lower map). The arrow 
gives the direction of the local magnetic moment whose magnitude is given. The Fe 
sites have a grey arrow head and the Cr sites have a black one and a grey background 
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give nearly the same result. This behaviour corresponds more to the expected 
one. 



5 Summary 

It has been shown that real space methods like the recursion technique allow to 
study the magnetic properties of complex systems in a much larger variety of 
configurations than most of the other approaches. However, the use of a tight 
binding description of the band structure limits the confidence in the results 
and it has been exhibited that d restricted and spd parameters give significantly 
different results when non-collinear magnetic solutions are allowed. This is why, 
more accurate band structure description consistent with a real space approach 
are needed and the TB-LMTO method is a possible candidate. 
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